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PUT 

NOTIFICATION OF ELECTION 

(PCT Rule 61.2) 


To: 

Assistant commissioner Tor raxenis 
United States Patent and Trademark 
Office 
Box PCT 

Washington, D.C. 20231 
ETATS-UNIS D'AMERIQUE 

in its capacity as elected Office 


Date of mailing (day/month/year) 
16 June 2000(16.06.00) 




International application No. 
PCT/FI99/00870 


Applicant's or agent* s file reference 
31531 


International filing date (day/month/year) 
20 October 1999 (20.10.99) 


Priority date (day/month/year) 

23 October 1998 (23.10.98) 


Applicant 

YLIHONKO, Kristiina et al 



1. The designated Office is hereby notified of its election made: 

| X | in the demand filed with the International Preliminary Examining Authority on: 

08 May 2000 (08.05.00) 



| | in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

| | was not 

made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 



The International Bureau of WIPO 


Authorized officer 




34, chemin des Colombettes 


Manu Berrod 


1211 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 



Form PCT/IB/331 (July 1992) FI9900870 



PCT/FI99/00870 



Applicant*sor agent's I International application No. 

fllerefercnce 31531 j P C I / F I 9 9 / 0 0 8 7 0 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 



(PCT Rule I 3 to) 



A. The indications made below relate to the deposited microorganism or other biological material referred to in the description 
on page 18 .line S 17—21 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet | | 



Name of depositary institution 

Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ) 



Address of depositary institution (including postal code and country / 



Mascheroder Weg 1b, D-38124 Braunschweig, Germany 



Date of deposit Accession Number 

14 October 1998 I DSM 12451, DSM 12452 



C. ADDITIONAL INDICATIONS (leave blank if not applicable} This information is continued on an additional sheet 



In respect of those designations in which a European patent is sought, a sample of the 
deposited biological material will be made available as provided in Rule 28(3) EPC, until the 
publication of the mention of the grant of the European patent or for 20 years from the date of 
filing if the application is refused or withdrawn or deemed to be withdrawn, only by the issue of 
such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank it not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of 'the indications e.g.. "Accession 
Number of Deposit") 



For receiving Office use only 



I I This sheet was received with the international application 



Authorized pfftcrr 

/ 



For International Bureau use only 



| | This sheet was received by the International Bureau on; 



Authorized officer 



Form PCT/RO/134 <July1998) 



PATENTTI- JA REKISTE^^UXITUS 

Patentti- ja innovaatiolinja 



T^^CIMUSRAPORTTI 



HAKEMUSNUMERO 

982295 


LUOKITUS 


C 12 N 15/31, 15/52, 15/76, C 07 H 15/252, C 12 P 19/56, C 12 N 1/21 


// (C 12 N 15/31 , C 12 R 1 :465) □ jatkuu kaantopuolella 




TUTKITTU AINEISTO 



Patenttivirastojen julkaisut 

FI, SE, NO, DK, DE, CH, EP f WO, GB, US: 



FI, SE, NO, DK, julkaisut viraston julkaisukokoelmasta luokista C 12 N 15/31, 15/52 15/76 C 07 H 
15/252, C 12 P 19/56 



.Tl?.?.^ Index, EPO Documentation, Patent Abstracts of Japan, US full text (Epoque) 
' D jatkuu kaantopuolella 



Muu aineisto 

Tietokannoista Caplus, Biosis, Medline, Registry (STN) 



□ jatkuu kaantopuolella 



VIITEJULKAISUT 


Kategoria" } 


Julkaisun tunnistetiedot 


Koskee 
vaatimuksia 


A 




Ylihonko, K. et al., WO 96/10581, C 07 K 14/36 


1-15 


A 




Ylihonko, K. et al., Microbiology 142 (1996) i 
1965 - 1972 j 


1-15 


A 




Ylihonko, K. et al., Mol . Gen. Genet. 251 (1996) ! 
113 - 120 \ 


1-15 


A 




Torkkell, S. et al., Mol. Gen. Genet. 256 (1997) ! 
203 - 209 ! 


1-15 








□ jatkuu kaantopuolella 


*) X 
Y 

A 


Patentoitavuuden kannalta merkittava julkaisu yksinaan tarkasteltuna 

Patentoitavuuden kannalta merkittava julkaisu, kun otetaan huomioon tama ja yksi tai useampi 
samaan kategoriaan kuuluva julkaisu 

Yleista tekniikan tasoa edustava julkaisu, ei kuitenkaan patentoitavuuden este 


Paivays 






Tutkija 
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'tent cooperation treat 

PCT 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 
31531 


FOR FURTHER ACTION ^ ee Notification of Transmittal of International 

Preliminary Examination Report (Form PCT/TPE A/4 1 6) 


International application No. 
PCT/FI99/00870 


International filing date (day/ month/year) 
20.10.1999 


Priority date (day/month/year) 
23.10.1998 


International Patent Classification (IPC) or national classification and IPC7 
C 07 K 14/36, C 12 N 15/31, C 12 P 19/56 


Applicant 

Galilaeus Oy et, al 



I . This international preliminary examination report has been prepared by this International Preliminary Examining 
Authority and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 



sheets, including this cover sheet 



I I This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 



These annexes consist of a total of 



sheets. 



This report contains indications relating to the following items: 
Basis of the report 



I 


El 


II 


□ 


III 


□ 


IV 


□ 


V 




VI 


□ 


VII 


□ 


VIII 


□ 



Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

Certain documents cited 



Date of submission of the demand 

08.05.2000 


Date of completion of this report 
12.01.2001 


Name and mailing address of the IPEA/SE 
Patent- och registreringsverket Telex 
Box 5055 17978 
S-102 42 STOCKHOLM PATOREG-S 

Facsimile No. 08-667 72 88 


Authorized officer 

Patrick Andersson/EO 
Telephone No. 08-782 25 00 



m 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



tional application No. 
PCT/FI99/00870 



I. Basis of the report 



1. With regard to the elements of the international application:* 
[^Xj the international application as originally filed 

I I the description: 

pages 

pages 

pages 



, as originally filed 
, filed with the demand 



□ 



, filed with the letter of 



the claims: 

pages 

pages 

pages 
pages 



, as originally filed 

, as amended (together with any statement) under article 1 9 
> filed with the demand 



I 1 the drawings: 

pages 

pages 

pages 



, filed with the letter of 



, as originally filed 
, filed with the demand 



, filed with the letter of 



□ 



the sequence listing part of the description: 

pages 

pages 

pages 



, as originally filed 
, filed with the demand 



, filed with the letter of 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language which is: 

| | the language of a translation furnished for the purposes of international search (under Rule 23. 1(b)). 
| | the language of publication of the international application (under Rule 48.3(b)). 

I I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55 2 and/ 
1 — 1 or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international 
preliminary examination was carried out on the basis of the sequence listing: 

| [ contained in the international application in written form. 

| | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

| | furnished subsequently to this Authority in computer readable form. 

I I The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international aoDlication as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence listing has 
been furnished. 

4. | 1 The amendments have resulted in the cancellation of: 

1 | the description, pages 

I 1 the claims, Nos. 

1 | the drawings, sheet/fig 



5 I I ™ s report nas * 3Ccn estaolisne d as if (some of) the amendments had not been made, since they have been considered to go 
* ' ' beyond the disclosure as filed, as indicated in die Supplemental Box (Rule 70.2 (c)).** 

* Replacement sheets which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed'* and are annexed to this report since they do not contain amendments (Rules 70 16 

and 70. 17). 

* Any replacement sheet containing such amendments must be referred to under item I and annexed to this report 



Form PCT/IPEA/409 (Box I) (January 1998) 



m 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



Intuitional application No. 
PCT/FI99/00870 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



1 . Statement 

Novelty (N) 

Inventive step (IS) 



Claims 1-15 
Claims 



Claims 1-15 
Claims 



Industrial applicability (IA) Claims 1-1 5 

Claims 



YES 
NO 

YES 
NO 

YES 
NO 



2. Citations and explanations (Rule 70.7) 



The claimed invention relates to a gene cluster .for the 
anthracycline biosynthetic pathway of Streptomyces nogalater 
comprising one lOkb fragment and one 7kb flanked Bglll 
fragment of S. nogalater genome. The cluster, or parts 
thereof, can be used for synthesis of hybrid antibiotics. 

The following document is considered relevant: 

Dl) Torkell S et al., "Characterization of Streptomyces 
nogalater genes encoding enzymes involved in the glycosylation 
steps in nogalamycin biosynthesis", Mol Gen Genet, 1997, vol 
256, pages 203-209. 

Dl discloses the characterization of S. nogalater genes 
involved in the glycosylation steps in nogalamycin 
biosynthesis. The genes are studied by inserting parts of a 
gene cluster into a strain of S. galilaeus (H039) , thereby 
producing hybrid compounds. A sequence similarity search with 
the sequence for the presently claimed snoal gene indicates 
that snoaL is known as a sub-sequence of sequence AF187532 
related to Dl . 

In relation to the subject matter of the other claims Dl is 
considered to represent the general state of the art and has 
no particular relevance. Thus, the invention according to 
claims 1-15 is novel, inventive and industrially applicable. 
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T COOPERATION TREA" 

From the INTERNATIONAL BUREAU 



PCT 



NOTIFICATION CONCERNING 
SUBMISSION OR TRANSMITTAL 
OF PRIORITY DOCUMENT 

(PCT Administrative Instructions, Section 411) 



To: 



OY JALO ANT-WUORINEN AB 
Iso Roobertinkatu 4-6 A 
FIN-00120 Helsinki 
FINLANDE 



Date of mailing (day/month/year) 

25 November 1999 (25.11.99) 






Applicant's or agent's file reference 
31531 


IMPORTANT NOTIFICATION 


International application No. 
PCT/FI99/00870 


International filing date (day/month/year) 

20 October 1999 (20.10.99) 


International publication date (day/month/year) 

Not yet published 


Priority date (day/month/year) 

23 October 1 998 (23. 1 0.98) 


Applicant 

GAL1LAEUS OY et al 



1. The applicant is hereby notified of the date of receipt (except where the letters "NR" appear in the right-hand column) by the 
International Bureau of the priority document(s) relating to the earlier applicatton(s) indicated below. Unless otherwise 
indicated by an asterisk appearing next to a date of receipt, or by the letters "NR"; in the right-hand column, the priority 
document concerned was submitted or transmitted to the International Bureau in compliance with Rule 17.1 (a) or (b). 

2. This updates and replaces any previously issued notification concerning submission or transmittal of priority documents. 

3. An asterisk(*) appearing next to a date of receipt, in the right-hand column, denotes a priority document submitted 
or transmitted to the International Bureau but not in compliance with Rule 1 7.1 (a) or (b). In such a case, the attention 
of the applicant is directed to Rule 17.1(c) which provides that no designated Office may disregard the priority claim 
concerned before giving the applicant an opportunity, upon entry into the national phase, to furnish the priority document 
within a time limit which is reasonable under the circumstances. 

4. The letters M NR" appearing in the right-hand column denote a priority document which was not received by the International 
Bureau or which the applicant did not request the receiving Office to prepare and transmit to the International Bureau, 

as provided by Rule 17.1(a) or (b), respectively. In such a case/the attention of the applicant is directed to Rule 17.1(c) which 
provides that no designated Office may disregard the priority claim concerned before giving the applicant an opportunity, 
upon entry into the national phase, to furnish the priority document within a time limit which is reasonable under the 
circumstances. 



Priority date 



23 Octo 1998 (23.10.98) 



Priority application No. 



982295 



Country or regional Office 
or PCT receiving Office 

Fl 



Date of receipt 
of priority document 

16 Nove 1999 (16.11.99) 



The International Bureau of WIPO 
34, chemin des Colombettes 
121 1 Geneva 20, Switzerland 



Facsimile No. (41-22) 740.14.35 



Authorized officer 

Catherine Massetti 

Telephone No. (41-22) 338.83.38 




Form PCT/IB/304 (July 1998) 



2976481 
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From the INTERNATIONAL BUREAU 



WO 00/24775 
PCT/FI99/00870 

12.05.200 0 



PCT 



NOTICE INFORMING THE APPLICANT OF THE 
COMMUNICATION OF THE INTERNATIONAL 
APPLICATION TO THE DESIGNATED OFFICES 

(PCT Rule 47.1(c), first sentence) 



To: 



OY JALO ANT-WUORINEN AB 
Iso Roobertinkatu 4-6 A 
FIN-00120 Helsinki 
FINLANDE 



Date of mailing (day/mo nth/year) 

04 May 2000 (04.05.00) 






Applicant's or agent's file reference 
31531 


IMPORTANT NOTICE 


International application No. 
PCT/FI99/00870 


International filing date (day/month/year) 
20 October 1 999 (20. 1 0.99) 


Priority date (day/month/year) 

23 October 1998 (23.10.98) 


Applicant 

GALILAEUS OY et al 



1 . Notice is hereby given that the International Bureau has communicated, as provided in Article 20, the international application 
to the following designated Offices on the date indicated above as the date of mailing of this Notice: 

JP,US 



In accordance with Rule 47.1 (c), third sentence, those Offices will accept the present Notice as conclusive evidence that 
the communication of the international application has duly taken place on the date of mailing indicated above and no copy 
of the international application is required to be furnished by the applicant to the designated Office(s). 

2. The following designated Offices have waived the requirement for such a communication at this time: 
EP 



The communication will be made to those Offices only upon their request. Furthermore, those Offices do not require the 
applicant to furnish a copy of the international application (Rule 49.1 (a-bis)). 

3. Enclosed with this Notice is a copy of the international application as published by the International Bureau on 
04 May 2000 (04.05.00) under No. WO 00/24775 



REMINDER REGARDING CHAPTER II (Article 31(2)(a) and Rule 54,2) 

If the applicant wishes to postpone entry into the national phase until 30 months (or later in some Offices) from the priority 
date, a demand for international preliminary examination must be filed with the competent International Preliminary 
Examining Authority before the expiration of 19 months from the priority date. 

It is the applicant's sole responsibility to monitor the 1 9-month time limit. 

Note that only an applicant who is a national or resident of a PCT Contracting State which is bound by Chapter II has the 
right to file a demand for international preliminary examination. 

REMINDER REGARDING ENTRY INTO THE NATIONAL PHASE (Article 22 or 39(1)) 

If the applicant wishes to proceed with the international application in the national phase, he must, within 20 months 
or 30 months, or later in some Offices, perform the acts referred to therein before each designated or elected Office. 

For further important information on the time limits and acts to be performed for entering the national phase, see the 
Annex to Form PCT/IB/301 (Notification of Receipt of Record Copy) and Volume II of the PCT Applicant's Guide. 





The International Bureau of WIPO 
34, chemin des Colombettes 
1211 Geneva 20, Switzerland 

Facsimile No. (41-22) 740.14.35 


Authorized officer 

J. Zahra 

Telephone No. (41-22) 338.83.38 



Form PCT/IB/308 (July 1996) 



3247937 



Continuation of Form PCT/IB/308 

NOTICE INFORMING THE APPLICANT OF THE COMMUNICATION OF 
THE INTERNATIONAL APPLICATION TO THE DESIGNATED OFFICES 



WO 00/24775 
PCT/FI99/00870 



Date of mailing (day/month/year) 
04 May 2000 (04.05.00) 


IMPORTANT NOTICE 


Applicant's or agent's file reference 
31531 


International application No. 

PCT/FI99/00870 



The applicant is hereby notified that at the time of establishment of this Notice, the time limit under Rule 46.1 for making 
amendments under Article 19 has not yet expired and the International Bureau had received neither such amendments nor a 
declaration that the applicant does not wish to make amendments. 



Form PCT/IB/308 (continuation sheet) (July 1996) 
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INFORMATION CONCERNING ELECTED 
OFFICES NOTIFIED OF THEIR ELECTION 

(PCT Rule 61.3) 



To: 



OY JALO ANT-WUORINEN AB 
Iso Roobertinkatu 4-6 A 
FIN-00120 Helsinki 
FINLANDE 



Date of mailing (day/month/year) 
16 June 2000(16.06.00) 






Applicant's or agent's file reference 
31531 


IMPORTANT INFORMATION 


International application No. 
PCT/FI99/00870 


International filing date (day/month/year) 

20 October 1999 (20.10.99) 


Priority date (day/month/year) 

23 October 1998 (23.10.98) 


Applicant 

GALILAEUS OY et al 



1. The applicant is hereby informed that the International Bureau has, according to Article 31(7), notified each of the following 
Offices of its election: 



EP :AT,BE,CHXY,DE,DK,ES,FI,FR/3B,GRJEJT,LU,MC,NL,PT,SE 
National :JP,US 



2... The following-Offices have. waived the requirement forthe-notification of their-election; the-notification will be sent iothern ' 
by the International Bureau only upon their request: 

None r 

3. The applicant is reminded that he must enter the "national phase" before the expiration of 30 months from the priority date 
before each of the Offices listed above. This must be done by paying the national . fee(s) and furnishing , if prescribed, a 
translation of the international application (Article 39(1 )(a)), as well as, where applicable, by furnishing a translation of any 
annexes of the international preliminary examination report (Article 36(3)(b) and Rule 74.1). 

Some offices have fixed time limits expiring later than the above-mentioned time limit . For detailed information about the 
applicable time limits and the acts to be performed upon entry into the national phase before a particular Office, see Volume I 
of the PCT Applicant's Guide. 

The entry into the European regional phase is postponed until 31 months from the priority date for all States designated for 
the purposes of obtaining a European patent. 



The International Bureau of WIPO 


Authorized officer: 
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Manu Berrod 
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(11) International Publication Number: 
(43) International Publication Date: 
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4 May 2000 (04.05.00) 



(21) International Application Number: PCT/F199/00870 

(22) International Filing Date: 20 October 1999 (20.10.99) 



(30) Priority Data: 

982295 



23 October 1998 (23.10.98) 



FI 



(71) Applicant (for all designated States except US): GALILAEUS 
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(72) Inventors; and 
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With international search report. 
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(54) Title: GENE CLUSTER INVOLVED IN NOGALAMYCIN BIOSYNTHESIS, AND ITS USE IN PRODUCTION OF HYBRID 
ANTIBIOTICS 



(57) Abstract 



The present invention relates to the gene cluster for nogalamycin biosynthesis derived from Streptomyces nogalater, and the use of 
the genes therein to obtain novel hybrid antibiotics for drug screening. 
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Gene cluster involved in nogalamycin biosynthesis, and its use in production of 
hybrid antibiotics 

Field of the invention 

5 

This invention relates to the gene cluster for nogalamycin biosynthesis derived from 
Streptomyces nogalater, and the use of the genes therein to obtain novel hybrid 
antibiotics for drug screening. 

10 Background of the invention 

Anthracyclines are antitumor antibiotics, mainly produced by Streptomyces sp. Dauno- 
mycin family of anthracyclines is commercially most important, since almost all of the 
around ten anthracyclines currently in clinical use, or in late clinical trials for cytotoxic 

15 drugs, belong to this family. Despite the long history of anthracyclines, three decades or 
so, the studies on their biosynthesis are still going on, and there is further interest to 
obtain novel molecules for the development of cancer chemotherapeutics. A method 
currently used for finding novel molecules for drug screening is genetic engineering. 
Cloning the genes for anthracycline biosynthesis facilitates the production of hybrid 

20 anthracyclines, as well as their use in combinatorial biosynthesis to generate novel 
molecules. 

Nogalamycin, which was first described by Bhuyan and Dietz in 1965, is an anthra- 
cycline antibiotic produced by Streptomyces nogalater. It is highly active against tumor 

25 cells, whereas toxic properties of this compound have prevented its progress to clinical 
trials (Bhuyan and Smith, 1975). However, menogaril (7-O-mcthylnogarol) is a 
semisynthetic derivative of nogalamycin, and its value in the treatment of cancer has 
been studied (e.g. Yoshida et al., 1996), the interest being now mainly in Japan. 
Structurally nogalamycin (Fig. 1) differs from most other anthracyclines, as e.g. from 

30 the daunomycin family, in two noteworthy features: (i) The stereochemistry at position 
nine is opposite, and (ii) it has a sugar moiety, in which nogalamine is attached at 
position 1 by a typical glycosidic bond, but it is also attached to carbon 2 by an 
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extraordinary C-C bond. Structural elucidation of nogalamycin was reported by Wiley 
et al (1977). Furthermore, biosynthetic studies of nogalamycin have been published by 
Wiley et al in 1978 giving information of the building blocks: The aglycone moiety is 
built from ten acetates; the neutral sugar, nogalose, is derived from glucose; and methyl 
5 groups of both of the sugars, nogalamine and nogalose, are transferred from methionine. 
The origin of nogalamine was not clearly solved by Wiley, but most probably nogal- 
amine is also derived from glucose. 

Molecular cloning of biosynthesis genes for anthracyclines has facilitated the studies on 
10 molecular genetics, providing tools for rational modifications of the structures, while 

also for surprising combinations with other antibiotics. Most of the interest has focused 
on daunomycin biosynthesis genes, as reported in several publications (Lomovskaya et 
al., 1998; Rajgarhia and Strohl, 1997 and references therein). Some genes for aclacino- 
mycin biosynthesis from 5. galilaeus (Fujii and Ebizuka, 1997) and for rhodomycin 
15 biosynthesis from S. purpurascens (Niemi et al, 1994) have been cloned as well. We 

have cloned the biosynthesis genes for nogalamycin, and successfully used the genes for 
producing hybrid anthracyclines. Most of the genes are involved in polyketide pathway, 
being responsible for the formation of a tricyclic intermediate, and they are reported in 
Ylihonko et al, 1996a and b, and by Torkkell et al, 1997. Despite the advances in 
20 molecular cloning, the biosynthetic pathway from glucose to sugars found in anthra- 
cyclines is still mainly hypothetical. 

Regarding the genes for deoxyhexose pathway, Madduri et ai (1998) have reported that 
a gene derived from avermectin biosynthesis cluster caused the production of hybrid 

25 anthracyclines altering the sugar moiety when transferred into an S, peucetius mutant. 
The product obtained was epirubicin, a commercially important anthracycline. In this 
case a hydroxy group in the daunosamine moiety was in the opposite stereochemistry 
due to the action of an avermectin biosynthesis gene. S. galilaeus has been used as the 
host to prepare hybrid anthracyclines using the genes derived from rhodomycin pathway 

30 from S. purpurascens (Niemi et al, 1994), and from nogalamycin biosynthesis cluster 
from S. nogalater (Ylihonko et al, 1996a). The genes for nogalamycin pathway were 
used to generate the hybrid anthracycline production in S, steffisburgensis producing 
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typically steffimycin (Kunnari et al, 1997). Previously, biosynthesis genes for actino- 
rhodin have been expressed in S. galilaeus resulting in the formation of aloesaponarin 
(Strohl et al., 1991). These hybrid compounds were modified in the aglycone moiety. 

5 Summary of the invention 

The present invention concerns a gene cluster of Streptomyces nogalater, most of the 
genes of the cluster being derived from the deoxyhexose pathway for nogalamine and 
nogalose. Expressing a DNA fragment of the said region in S. galilaeus, which produces 

10 aclacinomycins, hybrid anthracyclines are obtained, wherein the aglycone moiety is 
derived from S. galilaeus, whereas the sugar moiety is characteristic neither to S. 
nogalater nor to S. galilaeus. Furthermore, when inserting the gene included in said 
cluster, encoding a cyclase for nogalamycin, into a suitable plasmid construction, 
nogalamycinone is obtained, which is the aglycone of nogalamycin. Since the stereo- 

15 chemistry of nogalamycin differs from most other anthracyclines, using this gene 
enables the preparation of C-9 stereoisomers of the anthracycline molecules. 

Detailed description of the invention 

20 The experimental procedures of the present invention are methods conventional in the 

art. The techniques not described in detail here are given in the manuals by Hopwood et 
al. "Genetic manipulation of Streptomyces: a laboratory manual" The John Innes 
Foundation, Norwich (1985) and by Sambrook et al. (1989) "Molecular cloning: a 
laboratory manual". The publications, patents and patent applications cited herein are 

25 given in the reference list in their entirety. 

The present invention concerns particularly the gene cluster for nogalamycin 
biosynthesis (5no5-cluster) causing the production of hybrid antibiotics with modifica- 
tions in the sugar moiety. The invention concerns in specific the use of the genes for 
30 nogalamine/nogalosc biosynthesis to generate hybrid antibiotics modified in sugar 

moieties. The invention also concerns the use of a specific cyclase gene included in the 



10 
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gene cluster of the invention, to generate the C-9 stereoisomers of typical anthra- 
cyclines. 

The gene cluster according to the present invention is linked to the earlier reported 
clusters for nogalamycin biosynthesis. The starting point of the present invention was 
the gene cluster for nogalamycin chromophor (International Patent Application WO 
96/10581). Subsequently, we have found some genes for the deoxyhexose pathway of 
nogalamycin biosynthesis (Torkkell et aL, 1997), and a part of the fragment comprising 
said genes was used to clone the genes for this invention. 



The biosynthesis genes for nogalamycin can be isolated from Streptomyces sp., particu- 
larly from 5. nogalater, which produces nogalamycin. Species which produce nogala- 
mycin-like anthracyclines can also be used, e.g. S. violaceochromogenes producing 
arugomycin (Kawai et aL, 1987), or S. avidinii producing avidinorubicin (Aoki et aL, 
15 1991). 

Genomic DNA of a Streptomyces strain carrying the genes for nogalamycin biosynthesis 
is used in preparing a genomic library. Suitable gene fragments for cloning may be 
obtained by any frequently digesting restriction enzyme. Typically Sau3Al is used. The 

20 isolated fragments could be inserted by ligation in any Escherichia coli vector such as a 
plasmid, a phagemid, a phage, or a cosmid. A cosmid vector is preferred since it 
enables the cloning of large DNA fragments. A cosmid vector such as pFD666 (ATCC 
No. 77286) is suitable for this purpose, as it enables cloning of the fragments of about 
40 kb. The BamHl site of pFD666, giving sticky ends to the Sau3Al fragments may be 

25 used for cloning. Commercially available kits may be used to pack the DNA in phage 
particles. Various E. coli strains can be used for the infection by the DNA packed. An 
appropriate E. coli strain is, e.g. XLlBlue MRF\ which is deficient in several restric- 
tion systems. 



30 



Using E. coli as a host strain for the genomic library, hybridization is an advantageous 
screening strategy. The probe for hybridization may be any known fragment derived 
from the nogalamycin gene cluster, but a short fragment of about 1 kb derived from one 
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end of the biosynthetic region previously cloned is preferred. Colonies for the genomic 
library are transferred for filter hybridization to membranes, preferably to nylon 
membranes. Since the average size for a genomic DNA fragment is 40 kb, 2300 
colonies gave 99.99% probability to find the expanded region for nogalamycin 
5 biosynthesis. Any method for hybridization may be used but, in particular, the DIG 
System (Boehringer Mannheim, GmbH, Germany) is useful. Since the probe is 
homologous to the hybridized DNA, it is preferable to carry out the stringent washes of 
hybridization at 70°C in a low salt concentration according to Boehringer Mannheim's 
manual "DIG System User's Guide for Filter Hybridization". At least 80% homology is 
10 suggested to be needed for a DNA fragment to bind a probe in the conditions used for 
washes. 

Using this protocol, seven clones out of about 5000 gave positive signals, and were 
picked up for DNA isolation. Restriction mapping is an appropriate technique for 

15 characterizing the clones. The positive clones may be digested with convenient restric- 
tion enzymes to demonstrate the physical linkage map of the DNA fragments. The 
cosmid used for cloning was a shuttle cosmid replicating in both E. coli and 
Streptomyces sp. However, the transfer of the recombinant cosmids in S. lividans TK24, 
which is a typically used laboratory strain in cloning Streptomyces, resulted in deletions, 

20 and was omitted. Instead, we rather used in the expression studies the plasmid pIJ486, a 
high copy number Streptomyces plasmid. However, any plasmid being able to stably 
replicate in Streptomyces may be used for this purpose. 

Two Bglll fragments of one of the clones were separately inserted into pIJ486 vectors, 
25 and the two plasmids obtained were transferred into a primary host, S. lividans TK24. 
The recombinant plasmids obtained (pSY42 and pSY43), containing a 10 kb and a 7kb 
fragment from 5. nogalater genomic DNA, respectively, were isolated from the primary 
host and further introduced into other Streptomyces strains by protoplast transformation. 
The recombinant plasmid containing the 10 kb fragment caused the production of hybrid 
30 anthracyclines in the S. galilaeus mutant strain H039, which endogenously produces 
aklavinone-rhodinose-rhodinose-rhodinose. A few other S. galilaeus strains (H075, 
H026, H063) mutated in deoxyhexose pathway for sugars in aclacinomycin were used in 
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transformation, and new hybrid compounds were obtained. Since the structure of 
nogalamycin is almost unique among anthracyclines, the plasmids could be transferred 
to other anthracycline-producing strains, such as S. peucetius, which produces dauno- 
mycin, and S. purpurascens, which produces rhodomycins, to modify the structures of 
5 the characteristic antibiotics. 

As the cloned cluster was linked to nogalamycin biosynthesis region already known, its 
ability to generate the modification in sugar moiety suggested the presence of the genes 
for deoxyhexose pathway. However, sequencing is necessary to deduce the function of 

10 the genes in the cluster cloned. The DNA fragments of 10 kb and 7 kb were further 
inserted into the plasmid pSL1190 for subcloning. Sequencing strategies such as a 
deletion set of the DNA fragments, shotgun cloning or primer walking could be used, 
but we prefer to use restriction fragments for subcloning. Using ABI PRISM system 
(Perkin-Elmer) for sequencing it is possible to get 500 to 700 bases per one reaction, 

15 which means that about 1 kb fragments sharing overlapping bases are needed for 
sequencing. For this purpose, 27 subclones were constructed. 

Sequencing of the flanked Bglll fragments consisting of about 16000 bp revealed 15 
complete ORFs. The sequence analysis can be made by any computer based program, 
20 such as GCG (Madison, Wisconsin, USA) package. According to the present invention 
the putative gene functions as deduced from the sequence homology of those available 
in the libraries are 





aminotransferase (snogl), not completed 


1. 


dTDP-glucose synthase (snogJ) 


25 2. 


aminomethyl transferase (snogA) 


3. 


polyketide cyclase, (snoaM) 


4. 


a gene of deoxyhexose pathway, unknown (snogN) 


5. 


hydroxylase, (snoaG) 


6. 


dTDP-4-dehydrorhamnose reductase (snogC) 


30 7. 


dTDP-glucose 4,6-dehydratase (snogK) 


8. 


NAME cyclase (snoaL) 


9. 


unknown (sno K) 
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10. glycosyl transferase, GTF (snogD) 

11. unknown (snoW) 

12. glycocyl transferase, GTF (snogE) 

13. unknown (snoh) 

14. unknown (snoO) 

15. C-7 ketoreductase (snoaF) 
unknown (snoN), not completed 

Gene designations: g means that the gene involved in biosynthesis of the glycosidic 
proportion including glycosyl transferases, whereas a points out that the gene is needed 
for the formation of the aglycone moiety. 

Considering the proposed biosynthetic pathway for nogalamycin shown in Fig 3. we are 
able to cause several changes for the structures of antibiotics by the genes identified, 
including snoaL, responsible for the cyclization of the fourth ring of the aglycone 
moiety while determining the stereochemistry of the anthracyclinone, and the genes 
affecting the formation of nogalamine and nogalose (snogJ, snogK, snogN, snogC, 
snogA), and, in addition, the genes responsible for joining the sugar residues to the 
aglycone moiety (snogD and snogE). 

These genes could be separately inserted in a vector using suitable restriction sites, or 
by amplifying the genes by PCR. The fragments may contain an intrinsic promoter, or a 
promoter may be separately cloned. It is advantageous to use a vector carrying a 
promoter to allow expression of the genes in a Streptomyces strain. The plasmid 
pIJE486 contains a promoter ermE for erythromycin resistance gene, allowing constitut- 
ive expression of the genes inserted in a correct orientation. Special attention is drawn 
to the gene encoding a cyclase for the aliphatic ring, but any gene of said cluster may 
be expressed in Streptomyces hosts. The said cyclase converts the stereochemistry at C9 
of auramycinone in TK24, if inserted into the plasmid possessing the other genes for 
auramycinone biosynthesis, except the cyclase responsible for the typical 
stereochemistry of anthracyclines. 
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Streptomyces strains, in particular S. galilaeus, carrying the recombinant plasmids are 
cultivated in media wherein antibiotics are produced. The hybrid compounds are 
extracted with organic solvents from the culture broth, and the compounds are separated 
and purified using chromatographic techniques. 

5 

According to this invention 5. galilaeus H039 carrying the plasmid pSY42 and desig- 
nated as H039/pSY42 produces ak!avinone-4'-epi-2-deoxyfucose in El medium 
supplemented with thiostrepton to give selection pressure for the plasmid containing 
strains. 

10 

5. lividans TK24 carrying the plasmid pSY15c containing the genes for the nogalamycin 
chromophor and the genes for a cyclase (snoaU) and a ketoreductase (snoaF), was 
cultivated in El medium supplemented with thiostrepton. The compound 9-epi- 
auramycinone was produced, and this structure is now called nogalamycinone. Any 
15 DNA fragment of the invention subcloned from a 17 kb nogalamycin biosynthesis 

region can be inserted in a vector replicating in Streptomyces, and the products may be 
produced by fermentation of the plasmid containing strains. 

Brief description of the drawings 

20 

Fig. 1 shows the structures of nogalamycin, daunomycin and aclacinomycin. 



Fig. 2 is a diagram of the gene cluster (Sno5) of the invention for nogalamycin 
biosynthesis. 

25 

Fig. 3 describes the proposed biosynthesis pathway for nogalamycin. 



Fig. 4 shows a diagram of the plasmid pSY15c. The genes snoaL (aL) and snoaF 
(aF) shown black are inserted in the plasmid pSY15 to give pSY15c. aL 
30 represents a cyclase snoaL and aF is for C-7 ketoreductase snoaF. pSY15 

(WO 96/10581) generates the production of a tricyclic intermediate for nogala- 
mycin biosynthesis in 5. lividans. The abbreviations al, a2 and a3 refer to the 
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genes snoal, snoa.2 and snoa3 y respectively, for minimal PKS. rA is the snoxA 
gene for an activator, aB is the snoaB gene for oxygenase, aC is the snoaC 
gene for methylase, aD is the snoaD gene for polyketide ketoreductase and aE 
is the snoaE for aromatase. gF (the snogF gene) and gG (the snogG gene) 
5 involved in the deoxyhexosc pathway are not functional in the construct, aph 

is an aminoglycoside phosphotransferase gene, and tsr is a thiostreptone 
resistance gene. 

Examples to further illustrate the invention are given hereafter. 

10 

EXPERIMENTAL 
Materials used 

Restriction enzymes used were purchased from Promega (Madison, Wisconsin, USA) or 
15 Boehringer Mannheim (Germany), and alkaline phosphatase from Boehringer Mann- 
heim, and used according to the manufacturers' instructions. Proteinase K was purchased 
from Promega and lysozyme from Sigma (St. Louis, USA). Hybond™-N nylon 
membranes used in hybridization were purchased from Amersham (Buckinghamshire, 
England), DIG DNA Labelling Kit and DIG Luminescent Detection Kit from 
20 Boehringer Mannheim. Qiaquick Gel Extraction Kit from Qiagen (Hilden, Germany) 
was used for isolating DNA from agarose. 

Bacterial strains and their use 

- Escherichia coli XL1 Blue MRF* (Stratagene, La Jolla, CA) was used for cloning. 
25 - Streptomyces nogalater ATCC 27451; the gene cluster of nogalamycin biosynthesis 

was cloned from this strain. 

The host strains to express the genes cloned were: 

- Streptomyces lividans TK24, also used as a primary host to clone DNA propagated in 
E. coli. The strain was provided by prof. Sir David Hopwood, John Innes Centre, UK. 

30 - Streptomyces galilaeus H039, produces aklavinone-rhodinose-rhodinose-rhodinose 

- Streptomyces galilaeus H026, produces aclacinomycin N, ACMN, (aklavinone- 
rhodosamine-2-deoxyfucose-rhodinose) 
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- Streptomyces galilaeus H063, produces aklavinonc 

- Streptomyces galilaeus H075, produces aklavinone-rhodosaminc-2-deoxyfucose-2- 
deoxyfucose 

5 The detailed description of the mutants H039 and H026 is given in Ylihonko et al 
(1994) and of H075 in the FI patent application No. 981062 (Ylihonko et al. y 1998). 
H063 has not been described in the literature but it was obtained by NTG mutagenesis 
of S. galilaeus, and selected to be used as the host strain in the hybrid compound 
production, as it accumulates aklavinone without any sugar residues. 

10 

Plasmids 

E. coli - Streptomyces shuttle cosmid pFD666 (ATCC 77286) was used for cloning the 
chromosomal DNA. E. coli cloning vectors pSL1190 (Pharmacia) and pUC19 were used 
for preparing the subclones. 

15 

pIJ486 is a high copy plasmid vector provided by prof. Sir David Hopwood, John Innes 
Centre, UK (Ward et aL, 1986) 

pIJE486 is a vector containing ermE gene in the polylinker of pIJ486 (Bibb et aL, 
20 1985). 

pSY15 is a pIJ486 based plasmid construct, wherein the genes of polyketide pathway 
for nogalamycin biosynthesis were cloned (Ylihonko et aL, 1996a). 

25 Nutrient media and solutions 

For cultivation of S. nogalater for total DNA isolation TSB medium was used. 
Lysozyme solution (0.3 M sucrose, 25 mM Tris, pH 8 and 25 mM EDTA pH 8) was 
used in isolation of total DNA. TE buffer (10 mM Tris, pH 8.0 and ImM EDTA) was 
used to dissolve the DNA. 

30 

TRYPTONE-SOYA BROTH (TSB) 

Per litre: Oxoid Tryptonc Soya Broth powder 30 g. 
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ISP4 

Bacto ISP-mcdium 4, Difco; 37 g/1. 



El Per litre in tap water: 

glucose 20 g 

soluble starch 20 g 

Farmamedia 5 g 

yeast extract 2.5 g 

K,HPO<»3H 2 0 1.3 g 

MgSO 4 «7H 2 0 1 g 

NaCl 3 g 

CaC0 3 3 g 

pH adjusted to 7.4 before autoclaving 



General methods 

NMR data was collected with a JEOL JNM-GX 400 spectrometer at the ambient 
temperature. 'H and 13 C NMR samples were internally referenced to TMS. 

The anthracycline metabolites were detected by HPLC (LaChrom, Merck Hitachi, pump 
L-7100, detector L-7400 and integrator D-7500) using a LiChroCART RP-18 column 
(4.6x250mm). Acetonitrile:potassium hydrogen phosphate buffer (60 mM, pH 3.0 
adjusted with citric acid) was used as the mobile phase. Gradient system starting from 
65% to 30% of potassium dihydrogen phosphate buffer was used to separate the 
compounds. The flow rate was 1 ml/min and the detection was effected at 430 nm. 

ISP4 plates supplemented with thiostrepton (50 ^ig/ml) were used to maintain the 
plasmid carrying cultures. 

Example 1. Cloning the gene cluster for nogalamycin biosynthesis 



1.1 Cosmid library 

For the isolation of total DNA, Streptomyces nogalater (ATCC 27451) was grown for 
three days in 50 ml of TSB medium supplemented with 0.5% of glycine. The cells 
were harvested by ccntrifuging for 15 min at 3900 x g in 12 ml Falcon tubes, and the 



WO 00/24775 




PCT7FI99/00870 



12 

cells were stored at -20°C. Cells from a 12 ml sample of the culture were used to 
isolate the DNA. 5 ml of lysozyme solution containing 5 mg of lysozyme/ml was added 
onto the cells, incubated for 20 min at 37°C. 500 jA of 10% SDS containing 0.7 mg of 
proteinase K was added onto the cells and incubated for 80 min at 62°C, another 500 fA 
5 of 10% SDS containing 0.7 mg of proteinase K was added, and incubation was 

continued for 60 min. The sample was chilled on ice and 600 /A of 3M NaAc, pH 5.8 
were added, and the mixture was extracted with equilibrated phenol (Sigma). The 
phases were separated by centrifuging at 1400 x g for 10 min. The DNA was precipi- 
tated from the water phase with equal volume of isopropanol to spool with a glass rod, 
10 and washed by dipping to 70% ethanol, air dried and dissolved in 500 {A of TE-buffer. 

The chromosomal DNA was partially digested with Sau3Al. The DNA fragments were 
separated by agarose gel electrophoresis, and the fragments of 30 to 50 kb were cut 
from the 0.3% low gelling temperature SeaPlaque® agarose. The DNA bands were 

15 isolated from the gel by heating to 65°C, extracting with equal volume of equilibrated 
phenol, and the phases were separated by centrifuging for 15 min at 2500 x g. The 
phenol phase was extracted with TE buffer, centrifuged and the water phases were 
pooled. The DNA was precipitated by adding 0.1 volumes of NaAc, pH 5.8 and 2 
volumes of ethanol at -20°C for 30 min, centrifuged for 30 min at 15 000 rpm in 

20 Sorvall RC5C centrifuge using SS-34 rotor with adapters for 10 ml tubes. The pellet 
was air dried and dissolved in 20 /A of TE buffer. The isolated fragments were ligated 
to pFD666 cosmid vector digested with BamHl and dephosphorylated. The DNA was 
packed into phage particles, and infected to E. coli using Gigapack® III XL Packing 
Extract Kit according to the manufacturer's instructions. 

25 

1.2 Identification of the clones by hybridization 

The infected cells were grown on LB plates containing 50 ^wg/ml kanamycin and 
transferred to Hybond™-N nylon membranes (Amersham). The membranes were 
handled according to the protocol described in Boehringer Mannheim's manual "The 
30 DIG System User's Guide for Filter Hybridization". The probe used to screen the 

colonies for an expanded nogalamycin gene cluster was a 1.07 kb Sad fragment from 
the cluster described earlier (Torkkell et ai, 1997). The plasmid carrying the probe was 
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digested with Sacl, and the fragment was separated from the vector by agarose gel 
electrophoresis and isolated from the gel using Qiaquick Gel Extraction Kit (Qiagen). 
The probe was labelled by digoxygenin using random prime labelling system according 
to Boehringer Mannheim's manual "The DIG System User's Guide for Filter Hybrid- 
5 ization". 5000 colonies were screened by hybridization at 70°C using the probe 
described. Positive colonies were detected using DIG Luminescent Detection Kit 
(Boehringer Mannheim). Seven colonies gave a positive signal. Cosmids from the 
positive clones were isolated from a 5ml culture by alkaline lysis method. Restriction 
analysis showed that the cloned fragments overlapped each other representing at least 60 
10 kb of the continuous DNA. The positive clones obtained were designated as pFDSnol to 
pFDSnol. 

13. Subcloning the fragments for sequencing 

Clone No. 5, designated as pFDSnoS, was digested with Bglll, and for subcloning two 
15 fragments of about 10 kb and 7 kb were isolated and ligated to pSL1190 digested with 
Bglll and dephosphorylated. The plasmids obtained were named as pSn42 and pSn43, 
respectively. These two fragments cover the DNA region flanked to the previously 
characterized area of nogalamycin biosynthesis cluster. To determine the nucleotide 
sequence of the whole 17 kb region cloned in pSn42 and pSn43 the convenient 
20 restriction sites were used to subclone the fragments to the vector pUC19 or pSL1190 
giving 16 subclones from the insert of pSn42 and 11 subclones of pSn43. 

E. coli XL1 Blue MRF* cells were cultivated overnight at 37 °C in 5 ml of LB- 
medium supplemented with 50 M-g/ml of ampicillin. To isolate plasmids for sequencing 
25 reactions Wizard Plus Minipreps DNA Purification System kit of Promega, or Biometra 
silica spin plasmid miniprep kit of Biomedizinische Analytik Gmbh were used accord- 
ing to the manufacturers 1 instructions. 

DNA sequencing was performed using the automatic ABI DNA sequenator (Perkin- 
30 Elmer) according to the manufacturer's instructions. 
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1.4 Sequence analysis and the deduced functions of the genes 
Sequence analyses were effected using the GCG sequence analysis software package 
(Version 8; Genetics Computer Group, Madison, Wisconsin, USA). The translation table 
was modified to accept also GTG as a start codon. Codon usage was analysed using 
published data (Wright and Bibb 1992). 

According to the CODONPREFERENCE program the sequenced DNA fragment (SEQ 
ID NO;l) contained 15 complete open reading frames (ORFs), and the 5' end of other 
two ORFs in the both ends of the fragment according to the invention. The functions of 
the genes were concluded by comparing the amino acid sequences translated from their 
base sequences to the known protein sequences in the data banks. The results are shown 
in Table 1. The positions given refer to the appended sequence listing. The amino acid 
sequences of the peptides are given in SEQ ID NO:2 to SEQ ID NO:18. 
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Table 1 



Gene 


Position 


Amino acids 
(SEQ ID NO) 


Deduced function 


Remarks 


snogl 


-1027 
compl 


>342 (2) 


aminotransferase 


5' end 


snog} 


1192-2073 


293 (3) 


dTDP-glucose synthase 




snogA 


2106-2822 
compl 


238 (4) 


aminomethyl transferase 




snoaM 


2826-3800 
compl 


324 (5) 


a polyketide cyclase 




snogN 


3799-5025 


408 (6) 


dnrQ homology (Otten et 
aL, 1995), unknown 




snoaG 


5088-6356 


422 (7) 


hydroxylase 




snogC 


6334-7209 
compl 


291 (8) 


dTDP-4-dehydrorhamnose 
reductase 




snogK 


7245-8297 
compl 


350 (9) 


dTDP-glucose-4,6-de~ 
hydratase 




snoah 


8537-8941 


134 (10) 


NAME cyclase (nogalonic 
acid methyl ester) 




snoK 


8992-9699 


235 (11) 


unknown 




snogD 


9745-10917 
compl 


390 (12) 


gl y co syl transferase 






11057- 
11884 


275 (13) 


unknown 




snogE 


liyzo- 


>*\2A (14 ) 


glycosyl transferase 




snoL 


13335- 

13754 

compl 


139 (15) 


unknown 




snoO 


13974- 
14441 


155 (16) 


homologous to mtmX of 
mithramycin cluster 




snoaF 


14532- 
15377 


281 (17) 


C-7 ketoreductase analog- 
ous to aklaviketone keto- 
reductase 




snoN 


15450- 


>190 (18) 


unknown 


5' end 



*: nucleotide sequence of about 100 bp, not known 
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1.5 Expression cloning 

The 10 kb Bglll fragment from pFDSnoS was cloned into the plasmid pIJ486 and the 
plasmid obtained was named as pSY42. Correspondingly, the 7 kb Bglll fragment from 
pFDSnoS was cloned into the plasmid pIJE486, and the plasmid pSY43 was obtained. 
5 Plasmid pSY42 was introduced into S, lividans strain TK24 by protoplast transform- 
ation, isolated from it and further introduced into S. galilaeus mutant H039, and after 
propagation in H039, transferred to other S. galilaeus mutants blocked in the deoxy- 
hexose pathway for characteristic sugars of aclacinomycins (H075, H026, and H063). 
El medium was used for anthracycline production, and the products were extracted 
10 from the culture with toluenermethanol (1:1) at pH 7. Anthracycline metabolites were 
analyzed by HPLC. The products of the mutants H039, H026, H063 and H075 carrying 
pSY42 differed from those obtained by the mutants without the plasmid. 

According to the sequence analysis pSY42 contained a cyclase designated as NAMEC 
15 (nogalonic acid methyl ester cyclase), and in pSY43 a ketoreductase gene was ident- 
ified. Expression constructions were prepared which contained all the genes needed for 
the formation of nogalamycin aglycone. A 1.4 kb BamHl-Sacl fragment from pSY42 
(containing NAMEC) and a 1.1 kb Mlul-Kpnl fragment from pSY43 carrying the gene 
for a ketoreductase of C-7 keto group were ligated to pSY15 linearized by Sad, to 
20 form the plasmid pSY15c (Fig. 4). Plasmid pSY15c was introduced into S. lividans 
TK24, and the strain TK24/pSY15c was cultivated in El medium supplemented with 
thiostrepton. An aglycone compound was produced, and this structure is now called 
nogalamycinone. 

25 Example 2. Compounds generated by the snoS-cluster 

2.1 Production and purification of the products derived from H039/pSY42 and 
TK24/pSY15c 

The seed culture, 180 ml of El culture of the plasmid containing strain, H039/pSY42 or 
30 TK24/pSY15c, was obtained by cultivating the strain in three 250 ml Erlenmeyer flasks 
containing 60 ml of El medium supplemented with thiostrepton (5 yug/ml) for four days 
at 30°C, 330 rpm. The combined culture broths (180 ml) were used to inoculate 13 I of 
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El medium in a fermentor (Biostat E). Fermentation was carried out for seven days at 
28°C (330 rpm, aeration: 450 1/min). 

The cells were harvested by centrifuging. 2.6 1 of methanol was used to break the 
5 bacterial cells and to extract anthracycline metabolites accumulated. The anthracycline 
metabolites were extracted using 2 1 of dichloromethane at pH 6. The organic layer was 
evaporated to dryness. The viscous residue was flashed through a polyamide (11) 
column using watenmethanol from 1:9 to 0:10 as the eluent. Pooled fractions containing 
the compounds were further purified on a Merck-Hitachi HPLC using preparative 
10 reversed phase column (LichroCART RP-18, 5 jum) with mobile phase acetonitrilerl % 
AcOH in water (1:1). Evaporation of acetonitrile gave pure products as yellow powders 
dried under vacuum. 



2.2 Structural elucidation of the compounds derived from H039/pSY42 and 
15 from TK24/pSY15c 

NMR analysis included NON, BMC, NOE, DEPT and HMBC techniques. Protons were 
assignated using NOESY and 2D pTOCSY techniques and carbons using DEPT and 
HMBC techniques. 

20 As deduced from the data given in Tables 2 and 3, the structures revealed were 
akiavinone-4*-epi-2-deoxyfucose from the culture of H039/pSY42, and 9-epi- 
auramycinone (=nogalamycinone) from the culture of TK24/pSY15c. The chemical 
structures of the compounds are shown below in Formula I and Formula II, respectively. 



O COOCH 3 



30 




(0 



OH 



WO 00/24775 



PCT/FI99/00870 



18 



5 




OH O OH OH 



Deposited microorganisms 

10 

The following microorganisms were deposited according to the Budapest Treaty at 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ), 
Mascheroder Weg lb, D-38124 Braunschweig, Germany. 

15 Microorganism Accession number Date of deposit 

S. lividans TK24/pSY42 

carrying the plasmid pSY42 DSM 12451 14 October 1998 



20 S. lividans TK24/pSY43 

carrying the plasmid pSY43 



DSM 12452 14 October 1998 
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Table 2. *H and 13 C assignations of the compound ak!avinone-4'-epi-2- 
deoxyfucose (Formula I). 



Site 


n 




i 
l 


9 HA 1 W 9 < 11 

/./4, itt, ud, /.3, 1.3 


120.1 


9 


9 Aft 1 H 55/1 "7 ^ 
/.Oo, lrl, Qu,. o.4, /.D 


i n q 
13 /.3 




9 99 1 14 rlH CI 11 

/.Z/, in, ua, oJ, 1.3 


124.6 


4 




161.9 


4— Un 


11. /U, 1H, S 




4<i 




1 1 C A 

115.4 


c 




192.3 


JjS 




1 *1 A A 

114.4 


O 




1 /CO A 

162.4 


u un 


1 9 4A 1H c 




Ud 




13U.9 


7 


*\ 1 ft 1 T4 HH 4 1 7 1 

3-io, in, au, 3.1 


! 1.3 




Z.jI, lrl, QQ, 1D.U, 4.3 


33.9 




9 ^9 1R HH 1^0 ^ 1 




Q 




1 

/Z.l 


9-OH 


A ^ft 1 1-T c 
*t.^>o, in, :> 




10 


4 09 1M c 

H.UZ, In, b 




1 Op 




~i AO A 


1 1 

X J. 


7 40 1R c 
/.Hi/, in, s 


1 on c 

1ZU.O 


11a 

X J. CI 




1 H 1 
133.1 


12 




ioU. / 






13Z.O 


13A 


1.73, 1H, dq, 14.2, 7.4 


32.0 


13B 


1.51, 1H, dq, 14.2, 7.4 




14 


1.10, 3H, t, 7.4 


6.7 


15 




171.1 


16 


3.69, 3H, s 


52.5 


r 


5.41, 1H, d, 3.5 


101.7 


2'a 


1.75, 1H, ddd, 12.8, 11.2, 3.4 


37.7 


2'e 


2.19, 1H, dd, 12.8, 5.3 




3' 


3.71, 1H, ddd, 12.0, 9.0, 5.3 


69.0 


4' 


3.14, 1H, dd, 9.1, 9.0 


78.1 


5' 


3.88, 1H, dq, 9.1, 6.2 


68.8 


6' 


1.36, 3H, d, 6.2 


17.6 
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Table 3. *H and 13 C assignations of 9-epi-auramycinone (Formula II). 



Site 


'H 


"C 


1 


7.76, 1H, dd, 7.5, 1.2 


119.8 


2 


7.67, 1H, dd, 8.3, 7, 5 


137.4 


3 


7.28, 1H, dd, 8.3, 1.2 


124.8 


4 


- 


162.5 


4-OH 


11.86, 1H, s 


- 


4a 


- 


115.6 


5 


- 


192.7 


5a 


- 


114.6 


6 


- 


160.9 


6-OH 


12.76, 1H, s 


- 


6a 


- 


134.1 


7 


5.40, 1H, t, 7.0 


64.0 


8A 


2.66, 1H, dd, 13.9, 7.0 


40.9 


8B 


1.89, 1H, dd, 13.9, 7.1 




9 




70.5 


9-OH 


3.49, 1H, brs 




10 


3.93, 1H, d, 0.8 


56.0 


10a 




142.1 


11 


7.51, 1H, d, 0.8 


120.1 


11a 




133.3 


12 




180.9 


12a 




132.1 


13 


1.44, 3H, s 


28.7 


14 




173.0 


15 


3.90, 3H, s 


52.6 



WO 00/24775 



PCT/FI 99/00870 



21 

References 

Aoki, M., Shirai, H., Nakayama, N., Itezono, Y., Mori, Mitsuko, Satoh, T., Ohishima, 
S., Watanabe, J., and Yokose, K. 1991. Structural studies on avidinorubicin, a novel 
anthracycline with platelet aggregation inhibitor}' activity. 
J. Antibiot. 44: 635-645. 

Bibb, M.J., Janssen, G.R., and Ward, J.M. 1985. Cloning and analysis of the promoter 
region of the erythromycin resistance gene (ErmE) of Streptomyces erythraeus. Gene 
38: 215-226. 

Bhuyan, B.K., and Dietz, A. 1965. Fermentation, taxonomic and biological studies of 
nogalamycin. Antimicrob. Ag. Chemother. 1965: 836-844. 

Bhuyan, B.K., and Smith C.G. 1975. In: Antineoplastic and immunosuppressive agents, 
part II, pp. 623-632. Sartorell, A.C., and Johns, D.G. (eds), Springer- Verlag, Berlin. 

Fujii, I., and Ebizuka, Y. 1997. Anthracycline biosynthesis in Streptomyces galilaeus. 
Chem. Rev. 97: 7: 2511-2523. 

Kawai, H., Hayakawa, Y., Nakagawa, M., Furihata, K., Shimazu, A., Seto, H. and 
Otake, N. 1987. Arugomycin, a new anthracycline antibiotic L Taxonomy, fermentation, 
isolation and physico-chemical properties. J. Antibiot. 19: 1266-1271. 

Kunnari, T., Tuikkanen, J., Hautala, A., Hakala, J., Ylihonko, K., and Mantsala, P. 
1997. Isolation and characterization of 8-demethoxy steffimycins and generation of 2,8- 
demethoxy steffimycins in Streptomyces steffisburgensis by the nogalamycin 
biosynthesis genes. J. Antibiot. 50: 496-501, 

Lomovskaya, N., Doi-Katayama, Y., Filippini, S., Nastro, C, Fonstein, L:, Gallo, M., 
Colombo, A.L. and Hutchinson, C.R. 1998. The Streptomyces peucetius dpsY and dnrX 
genes govern early and late steps of daunorubicin and doxorubicin biosynthesis. J. 
Bacteriol. 180: 9: 2379-2386. 

Madduri, K., Kennedy, J., Rivola,G., lnventi-Solari, A., Filippini, S., Zanuso, G., 
Colombo, A.L., Gewain, K.M., Occi, J.L., MacNeil, D.J., and Hutchinson, C.R. 1998. 
Production of antitumor drug epirubicin (4*-epidoxorubicin) and its precursor by a 
genetically engineered strain of S. peucetius. Nature Biotech. 16: 69-74. 

Niemi, J., Ylihonko, K., Hakala, J., Kopio, A., Parssinen, R., and Mantsala, P. 1994. 
Hybrid anthracycline antibiotics: production of new anthracyclines by cloned genes from 
Streptomyces purpurascens in Streptomyces galilaeus. Microbiol. 140: 1351-1358. 

Otten, S.L., Liu, X., Ferguson, J., and Hutchinson C.R. 1995. Cloning and character- 
ization of the Streptomyces peucetius dnrQS genes encoding a daunosamine biosyntesis 
enzyme and a glycosyl transferase involved in daunorubicin biosynthesis. J. Bacteriol. 
177: 6688-6692. 



WO 00/24775 




PCT/FI99/00870 



22 

Rajgarhia, V.B., and Strohl, W.R. 1997. Minimal Streptomyces sp. strain C5 dauno- 
rubicin polyketide biosynthesis genes required for aklanonic acid biosynthesis. J. 
Bacteriol. 179: 8: 2690-2696. 

Strohl, W.R., Bartel, P.L., Li, Y., Connors, N.C., and Woodman, R.H. 1991. Expression 
of polyketide biosynthesis and regulatory genes in heterologous streptomycetes. J. Ind. 
Microbiol. 7: 3: 163-174. 

Torkkell, S., Ylihonko, K., Hakala, J., Skurnik, M., and Mantsala, P. 1997. Characteriz- 
ation of Streptomyces nogalater genes encoding enzymes involved in glycosylation steps 
in nogalamycin biosynthesis. Mol. Gen. Genet. 256: 203-209. 

Ward, J.M., Janssen, G.R., Kieser, T., Bibb, M.J., Buttner, MI, and Bibb, M.J. 1986. 
Construction and characterization of a series of multicopy promoter-probe plasmid 
vectors for Streptomyces using the aminoglycoside phosphotransferase from Tn5 as 
indicator. Mol. Gen. Genet. 203: 468-478. 

Wiley, P.F., Kelly, R.B., Caron, E.L., Wiley, V.H., Johnson, J.H., MacKellar, F.A. and 
Mizsak, S.A. 1977. Structure of nogalamycin. J. Am. Chem. Soc. 99: 542-549. 

Wiley, P.F., Elrod, D.W., and Marshall, V.P. 1978. Biosynthesis of the anthracycline 
antibiotics nogalamycin and steffimycin. B. J. Org. Chem. 43: 3457-3461. 

Wright, F., and Bibb, M.J. 1992. Codon usage in the G+C -rich Streptomyces genome. 
Gene 113: 55-65. 

Ylihonko, K., Hakala, J., Niemi, J., Lundell, J., and Mantsala, P. 1994. Isolation and 
characterization of aclacinomycin A-non-producing Streptomyces galilaeus (ATCC 
31615) mutants. Microbiol. 140: 1359-1365. 

Ylihonko, K., Hakala, J., and Mantsala, P. 1995. Process for producing anthracyclines 
and intermediates thereof. WO 96/10581. 

Ylihonko, K., Hakala, J., Kunnari, T., and Mantsala, P. 1996a. Production of hybrid 
anthracycline antibiotics by heterologous expression of Streptomyces nogalater nogala- 
mycin biosynthesis genes. Microbiol. 142: 1965-1972. 

Ylihonko, K., Tuikkanen, J., Jussila, S., Cong, L., and Mantsala, P. 1996b. A gene 
cluster involved in nogalamycin biosynthesis from Streptomyces nogalater: sequence 
analysis and complementation of early-block mutations in the anthracycline pathway. 
Mol. Gen. Genet. 251: 113-120. 

Ylihonko, K., Hakala, J., and Kunnari, T. 1998. Hybrid anthracyclines from genetically 
engineered Streptomyces galilaeus strains. No. 981062. 

Yoshida, M., Fujioka, A., Nakano, K., Yuasa, C, Toko, T., Takeda, S., and Unemi, N. 
1996. Activity of menogaril against various malignant lymphoma cell lines and a human 
lymphoma xenograft in mice. Anticancer Res. 16: 2875-2879. 



WO 00/24775 



PCT/FI99/00870 



23 

Claims 

1. Isolated and purified DNA fragment, which is the gene cluster for the anthracycline 
biosynthetic pathway of the bacterium Streptomyces nogalater, being included in a lOkb 

5 and a 7kb flanked Bglll fragments of S. nogalater genome. 

2. The DNA fragment according to claim 1, comprising the nucleotide sequence given 
in SEQ ID NO:l, or a sequence showing at least 80% homology to said sequence. 

10 3. A recombinant DNA, which comprises the DNA fragment according to claim 1 or 2, 
cloned in a plasmid replicating in Streptomyces. 

4. The recombinant DNA according to claim 3, which is the plasmid pSY15c, compris- 
ing a 1.4 kb BamHl-Sacl fragment from the plasmid pSY42 and a 1.1 kb Mlul-Kpnl 

15 fragment from the plasmid pSY43. 

5. Plasmid pSY42, deposited in S. lividans strain TK24/pSY42 with the deposition 
number DSM 12451. 

20 6. Plasmid pSY43, deposited in S. lividans strain TK24/pSY43 with the deposition 
number DSM 12452. 

7. A process for the production of hybrid compounds, comprising transferring the DNA 
fragment according to claim 1 or 2 into a Streptomyces host, cultivating the recombinant 

25 strain obtained, and isolating the compounds produced. 

8. The process according to claim 7, wherein the Streptomyces host is a Streptomyces 
galilaeus host. 



30 



9. The process according to claim 8, wherein the Streptomyces galilaeus host is selected 
from the strains H026, H039, H063 and H075, which are mutant strains of S, galilaeus 
ATCC 31615. 
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10. The process according to claim 8, wherein an anthracycline is produced, which has 
the following formula I 




11. The process according to claim 8, wherein an anthracyclinone is produced, which 
15 has the following formula II 




25 

12. A process for the production of hybrid compounds, comprising transferring at least 
one of the genes selected from the group consisting of snogJ, snogA, 5/zoaM, snogN, 
snoaG, snogC, snogK, snoaL, snoK> snogD y s/ioW, snogE, snoL, snoO and s/ioaF into a 
30 Streptomyces host, said genes being derived from the DNA fragment of claim 1 or 2, 
cultivating the recombinant strain obtained, and isolating the compounds produced. 
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13. The process according to claim 12, wherein the gene snoaL encoding NAME 
cyclase is transferred into a Streptomyces host. 

14. The process according to claim 12, wherein at least one of the genes snogY) and 
5 snogE encoding glycosyl transferases is transferred into a Streptomyces host. 

15. The process according to claim 12, wherein at least one of the genes s/iogJ, snogN, 
snogC, snogK and snogA affecting the formation of nogalamine and nogalose is 
transferred into a Streptomyces host. 
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SEQUENCE LISTING 

<110> Galilaeus Oy 

<120> Gene cluster involved in nogalamycin biosynthesis, and its use in 
production of antibiotics 

<160> 18 

<210> 1 

<211> 16020 

<212> DNA 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<221> misc_feature 

<222> 3799. ,3800 

<223> "overlapping sequence in the genes snoaH and s/3ogN" 

<221> misc_feature 

<222> 6334*. 6356 

<223> "overlapping sequence in the genes s;ioaG and snogC" 

<221> misc_f eature 

<222> 13201. .13300 

<223> "unknown region" 

<400> 1 



agatctcgtc 


cgccagtgcc 


tcggtgaccg 


gcaacgagcc 


cttggcgtag 


ccgagatggg 


60 


agaaaccggt 


catggtgtgc 


acgggccagg 


gataactgat 


gttgagggcg 


atgtcgtagg 


120 


aggcgcgcag 


ggcctccagc 


accgcgtccc 


gtcgcggatg 


gcgcaccacg 


tacacgtagt 


180 


agacgtgctc 


gttgccctgc 


gcggtcctcg 


gcagcagcag 


ccccgtgtcc 


gccaggccct 


240 


cctcatagcg 


gcgtgccacc 


gcccggcggg 


cctcgatgta 


ggacggcaac 


cgggacagct 


300 


tgcgccgcag 


gatctctgcc 


tgtacttcgt 


ccagccggct 


gttgtgcccg 


ggggtttcga 


360 


cgacgtagta 


gcggctctcc 


atgccgtagt 


agcgcagccg 


ccgcagccgg 


tccgccaccc 


420 


gctcgtcgtc 


ggtgagcacc 


gcgccgccgt 


ccccgtacgc 


gcccagcacc 


ttggtcgggt 


480 


agaaggagaa 


cgcggccgcg 


tcaccggtcg 


agccggcgag 


tcggccgtgc 


cggcgcgccc 


540 


cgtgcgcctg 


cgcgcagtcc 


tccaggatca 


ccaggttgtg 


ccgggcggcc 


agatcgcgca 


600 


gcggtgccat 


gtccacgcac 


tgcccgtaga 


ggtggaccgg 


cagcagacac 


cgggtgcgtg 


660 


gcgtgaggac 


ggcctccacc 


tgggacgtgt 


ccatcaggta 


gtcctcctcg 


cgcacgtcca 


720 


cgaagacggg 


cgtggcaccg 


gccgagtcga 


tcgcgacgac 


cgtgggcgcg 


gcggtgttgg 


780 


acacggtgac 


gacctcgtcg 


ccgggcccga 


cacccaaggc 


ctgtaacccc 


agcttgacgg 


840 


cgttggtccc 


gttgtcgacg 


ccgacggcat 


gtccgacgcc 


ctggaatgag 


gcgaactcgg 


900 


actcgaagcc 


gcgcacgctc 


tcaccgagga 


cgagccggcc 


ggagcggaac 


accgtctcca 


960 


cggcatcgtg 


gatgtcctcg 


cgttccagct 


cgtattccgg 


cagatagtcc 


cacacgtgta 


1020 


cggtcatcga 


gcccctccgg 


gattctccct 


gcgaaaagtc 


accactctac 


gacaacgttc 


1080 


accactcgct 


ttttcctcaa 


cgtccgcttg 


agacggcccg 


gcctgctgtg 


gcccggggaa 


1140 
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aggtgcggtc 


gttatcatcg 


^ ^ /^*~f +^ 4~ 4- 

auuLoy LOtu 


r* 4- /-* a +- 1" rrrrr A 


ggttgttcag 


ggt ga aggga 


1200 


n Mm m X* 4* A ^ A 

atCaLLCuCy 


cegg y g y u ql 


yyyy ay^ayy 


ctccacccga 


eg act ctege 


ggtgt ccaag 


1260 


cagcttctcc 


ccgucgggga 


waay ^^yo y 


atctactacc 


cgctctccgt 


getgatgetg 


1320 


/"^ /~T /-* ^trt^ ~s 

yccgyCyLCa 


/-» /~t/—t a a 4~ r r +" 
vyycK»au(»o u 


patcatcaac 


acaccgcacg 


aactcccccg 


aatgcgccgt 


1380 


Ctyt uCyyCy 


a p* /~t /~t r - * n r~" <~" 
aLLjyuyuauci 


nctcooactc 

u w w y y v» w 


cgcctggcct 
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8280 


oaacrtatQcg 


acgccatggg 


ttccctccat 


ccgtcgggtg 


ccgtggggcg 


gagtgcgccc 


8340 


cctcgaccca 


gcgtcggggg 

3 3 3 3 3 3 3 


cggccgtgga 


ggagcggttg 


agetteggeg 


cagcggcggc 


8400 


tcqaccqqcq 


gcggccggcg 


tcgccggact 


ccaacggttc 


tegaeggaac 


gaccaacggc 


8460 


cctqqcqaqa 

w 33 3 3 


ctgcccggac 


agcccggccg 


agagagggag 


gacccgttga 


gccgtcagac 


8520 


agagatcgtc 


cgccggatgg 


tgagegcett 


caacaccggc 


aggaccgacg 


acgtggacga 


8580 


gtacatccac 


cccgactacc 


tcaatccggc 


caccttggaa 


cacggcatcc 


acaccgggcc 


8640 


caaggcgttc 


qcccagctgg 


tcggctgggt 


gcgggcgacg 


ttctccgagg 


aagcccgcct 


8700 


qqaqqaqqtq 


eggatcgagg 


agcgcggccc 


gtgggtcaag 


gcctacctcg 


tgetctaegg 


8760 


ccgccacgtc 


ggccggcttg 


teggtatgee 


gcccaccgac 


cggcgcttct 


ccggtgaaca 


8820 


qqtgcacctg 

3 3 3 3 


atgegcateg 


tegaegggaa 


gatccgcgac 


caccgggact 


ggcccgactt 


8880 


ccaggggacg 


ctgcgccagc 


tcggcgaccc 


gtggcccgac 


gacgagggct 


ggcgtccgtg 


8940 


accgtccctg 


aaaccgcacc 


cgacgagaca 


tcagaccagg 


aaggatggct 


catgeeggat 


9000 


cccggcggcc 


cyoccacyyt- 


cgagaacctg 


tcgaaggagg 


ctgtccgctt 


ctaccgcgag 


9060 


cagggttacg 


tgcacatccc 


gcgcgtcctg teggagaegg 


aggtgacege 


cttccgggcc 


9120 


gcctgtgagg 


aggtcctgga 


gaaggagggc 


cgegagatet 


ccggcatcgc 


cctgcggctg 


9180 


gccggcgcgc 


ccctgcgggt 


ctacagcagc 


gacatcctgg 


teaaggagee 


caagcgcacc 


9240 


ctgcccaccc 


tggtccacga 


cgacgagacg 


ggactgccgc 


tgaacgagct 


gagtgccacg 


9300 
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ggat cgcgc t 


qacqqacqta 


cccgtcgaac 


gcggctgcat 


gage tacgtg 


9360 


rrrrnfrctCCC 

W* W — ' y ^ W L» W W W 


at ct cagggc 


ccqcqaqqac 


cggcaggagc 


acatgaccag 


cttcgccgag 


9420 


ttccooaacc 

K~ C* \M \^ W W 


tcacacfacot 


qtqqcccgat 


tacccgtggc 


agccgcgcgt 


cgccgtgccc 


9480 


g tc cgcgccg 


aaqacqtcqt 


gttccaccat 


tgccgtaccg 


tccacatggc 


cgaagccaac 


9540 


accagcgact 


cggtccgcat 


ggcgcatggc 


gtcgtctaca 


tggacgcgga 


cgccacctac 


9600 


W' ^4 VJ w w *— * 


tccaaaacqq 


ccacctgtcc 


cgcctgtcgc 


cgggagatcc 


actcgaaggc 


9660 


oaoctottcc 


ccctggtcac 


qqcaqqcaca 


cggcagtgag 


gtccgccgtt 


cccggcggtc 


9720 


, V_J u w u ' 


cqqaaacqqc 


accgtcagcc 


ggccagcgcc 


acgagcttgg 


cggccgtctc 


9780 


aaccGCtcQQC 


ggcatctcgc 


tcatctcctg 


ccgcacccgc 


agggccgcct 


cccgcaaccc 


9840 


caccrtcotcc 


agcagccgtc 


ggcactgctc 


ggcacccagc 


gatcccgcct 


eggcatcgaa 


9900 


cccgatgccc 


agcccggtca 


gcacatcgcg 


gttggtgtcc 


tggtaggagc 


cgtgcgggat 


9960 


gacgcactgc 


qqqacqccqq 


cggccagggc 


cgtcagcagt 


gtgccgctgc 


ccccgtgatg 


10020 


gatgafccgcg 


tcgcacgtct 


ccagcagcgc 


gcccagcgga 


atccactcca 


ccaccggtac 


10080 


g fct cgcgggc 


agt tcaccga 


qcaqggccag 


gtcgccgccg 


cccagggtca 


gcacgaactc 


10140 


cgcgt ccacg 


tccgccactt 


cgqagaacag 


cggggccagc 


ttggcgatgc 


cgcccgacag 


10200 


cacotcaata 


qaqcccaqcq 


tcaccgcgat 


acgccgccgg 


ccggccgcgg 


gcggcagcca 


10260 


atccaocaac 


accgctccgc 


cgttgtaggg 


gacgtaccgc 


atcggccagg 


cacccgggga 


10320 


gcgccggtcc 


tccggcagca 


gcgcctccac 


gctcggcggt 


gtcgtcgtca 


gccgcacgga 


10380 


accootcoac 


tcaccqqtqa 


cgccgtggcg 


ctcgtagtcc 


ttggacatcg 


cccgccggat 


10440 


aaacQCQCCQ 


agccccggct 


cgctgtccgc 


gggacccagc 


ggcagctcta 


cgcacggcag 


10500 


ttocaococt 


gccgccgtca 


qcqqqcccqc 


gccctgtgtc 


ggagtgtgca 


cgacgaggtc 


10560 


oaaccoccaa 


ctccgcgccg 


tccgcagcgc 


cccgtcgacg 


gccaccgccg 


atacccgggc 


10620 


gaacatctcg 


qcqaaqaaqc 


cctcgcccag 


cccctcggag 


tgcatcgggt 


cggtgacgtc 


10680 




ggcacgaaca 


gcttcgcgta 


gttcacgccg 


ggcgacacgt 


ccacggcgca 


10740 


cagcccggcc 


tccgcgacgg 


cgcggatgtc 


gccccccgtg 


gcgtagcgga 


cctcgtggcc 


10800 


qagaqcgcgc 


agcgcctgtg 


ccagcggcac 


cgtcggcagg 


atgtggctga 


gcccgggtga 


10860 


agtgatgaac 


aacgcacgca 


tgatgccccc 


tgttcgacat 


gaacctggaa 


cacgcatcct 


10920 


oacoQcqcct 


tctgttgctc 


cggtcgacgc 


ccggtcgaca 


ggccctcgta 


cagcccgccg 


10980 




cggccacgac 


gcaggctcca 


gcggacgtcg 


acggcgggga 


cgcagcgtgg 


11040 


tcgccgggag 


gcatcgatga 


o #"T"+~ ^ 4- +- nrt+" 

c ciyuciL. uy y u. 


a. o. LU^^ai^LL> 


GL w u — 1 y ciaa w w; 


tcqqccqaca 

wwwj^wwv^^jwu 


11100 


cgtcgtcacc 


gggctactgg 


ccgccggccg 


ccgggtgcgg 


gcgctgaccc 


gcacacccga 


11160 


ccggtccggc 


ctgcccggcg 


gcgcggagat 


cacaggcggc 


gacctgaccc 


gcccggagac 


11220 


ctacgagcgg 


atgctggacg 


gtgtcgaagc 


cgtctacctg 


ttccccgtcc 


cggagaccgc 


11280 


cgcggcgttc 


gccggggccg 


cgcgacgggc 


cggtgtccgg 


cggatcgtgg 


tgctctcctc 


11340 
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ggactccgtc 


aCCyaCyyCd 


uuyciL' ci^uy y 


aggacaccgg 


cqcatqqaac 


tggccgtgga 

T3j 3 3 3 


11400 


ggacacgggg 


CtCyayLyya 


LULaty tyuy 


ccccggcgag 


ttcgcgctca 


acaaggtcac 


11460 


cctgtgggcg 


wuy L^ya i>w v 




catcatccoa 


tccgcgtatc 


cggacgcccg 


11520 


ggtggccccg 


gtgcacgagg 


/-» /— < "a /-» /-t 4r /~« /"X /~« 

Luyauy Luy l 


y y u y i» w y w \j 


gtgaccgcgc 


tqctqaaqqa 


11580 


ggggcacgcc 


ggccgcgcct 


ci.u-ciyuyi.ycLU 


y y u w \*r y v<> cy 


gccctcaccc 


agegegaaca 


11640 


ggtccgcgcg 


gtaggggagg 


yyLLLyyLuy 


Im- V^r ^ ^ \p« 


ttcgtcgagg 


tgacccccgg 


11700 


gcaggcgcgg 


gccgacctga 


ccyct- L- ayyy 


yin uyuuuyuy 


CCCS.^.CQCCQ 

W \* *A w w %j w- 


actacgtcct 


11760 


cgccttccaa 


gccgggtgga 


uu>MCLyu>yyuu 


cgcccccgcc 


caaccaaccq 


tgcgggaggt 


11820 


caccggccgg 


CCCyCCCyCa 


Lvyuo^o 


citoocrccacc 

w w w v w 


gaccaccgag 


cggacttccg 


11880 


gtgaccggag 


accgcgtcca 


LLy uy L.uav^y 


L-L C_L au u v^* 


gacg cc eg tg 


cgcgtactgc 


11940 


tgacgtcctt 


cgccatggac 


rt^/^^iS f*"' -f- /— • 4- 

yt^ULawULw 


nrs eerie* cnl" 

y u a u u y w« y i- 


gccgctggcg 


tqqacactqc 


12000 


ggtcggccgg 


gcacgaggt a 




y u ciy w ^w> ^ ^ - 


gctcacctcc 


accatcacgg 


12060 


gagccggcct 


gaccgccgtg 


uuyy LL>yy 


ocoaccacac 


gcacggcagc 


ctcctgggcc 


12120 


gggtcggcag 


cgacat cctc 


n f t~*^ rrp a ffr 
yLLO uyuduy 


a c cia crcr ca a a 
clu y ^^y y y u 


ctacctaaaa 


gcccgtcacg 


12180 


acgccctggg 


+" /*T .T3 /-T 4- +- I**" 

Cuut^ay un# 


u u w c&cmy u y w 


acaacacggt 


oatqtccqcq 


ttgttctact 


12240 


cgcagatcaa 


CaaCgdt-tL-y 


A"t~ not ccraco 


acctggtgga 


cttcgcccgt 


cactggcggc 

3~ 33 


12300 


ccgacc ugg u 


u y uu ^»yyy a y 


rcnttcacct 


tcacoaacoc 


cqtqqccgcg 


cgggcctcgg 


12360 


gcgccgccca 


f~t f** ^ 

Cycccyccuy 


ci~ rrt"ccttcc 

u u y uu w i> i<u w 


ccgacctgtt 


cctcagcacg 


cgccgcctct 


12420 


^ <r /"*r a ^ s*r 

uCCtyyaycy 


a +- rr/~r r"»r"T rTT ("* 


caoaao ccca 

CL^H *J w w w 


agcaticacga 


cgacacactc 


gecgaatgge 


12480 


tcgactggac 


CCtuyyCCyy 


u Q w y y v u w ^> 


cctt cgacga 


aoaaat cote 


acqqqgcagt 


12540 


ggt ccatcga 


C C aQatt u U U 


yuuuuuy t-yi* 


ggctcgacgc 


caacaotccc 


accgtgccga 


12600 


tgcggtacgt 


CCCCtaCayC 


yyaL uyy uyi* 


ccacaatocit 


gcccgactgg 


ctqcqcaqqc 


12660 


cgcccgagcg 


yCCacyyy uc 


L> Lyy uuauuu 


tcaocatcac 


ctcaccracaa 


gtgaagtcct 


12720 


t cct cgccgt 


CuCCytyyaC 


y at. w u l l uuy 


aancccitaac 

ay y ^ ^ w 


eggget egge 


qtcqaqqtqq 


12780 


tcgccaccct 


Cyaw^CCyoC 


U ay 33 3 3 


t gctggggcg 


cataccqaac 


cacttccgca 


12840 


ucgu.cgo.gca 




oacocccrttc 

V-J ^1 *ih- ^ Ih» w 


tgccgacctg 


ctcggcgatc 


gtccaccacg 


12900 


gcggagccgg 


CaCCuyytuy 


acooccncca 

V»4 U w N--* ^-j 


tgt acggggt 


qc cQcacratc 

13 3 3 13 v 


tccctgggct 


12960 


cgatgtggga 


ccacttctac 


cgggcccguc 


y uuuyyayyci 


Q w y y yyv« y 


gggctgegge 


13020 


tCfCCCtCCCfQ 


cgagc ugacu 


yuuyayyyyu 


tncocaccco 

^ 4 *>^J t_4 ^_H> Vp^ ''-J 


actaoaaaaa 

y u. t-y y ay uyy 


atactcqacq 

T3 w T3 w *" w 313 > *3 


13080 


agccctcctt 


cggcaccgcc 


gcgcaggcgc 


tgagcgacac 


catcgcggcg 


^ A ^ f*l A A 

yaacccaycu 


J-O X ** vJ 


ccagcgaggt 


cgtgccggtc 


ctggaggagc 


tgaccggacg 


gcaccgtccc 


ggcacccggg 


13200 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


13260 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


ccgtccgggc 


ccctcgccgg 


13320 


tgagggagcc 


cggatcacag 


tccgtccggc 


accacgccca 


ggtcccggaa 


cageggggag 


13380 
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aagttgaaga 


cgucccciy uy 


t w civ y cx \^ 


ttgeeggett 


eggagaageg 


cagctcctcc 


13440 


aagtaggt cc 


agcggacctt 


yuyyttyy y 


craacicaat ac 

u V^J U ^ K-^ W \M V-h" 


ccatgaacac 


gece tgg tgc 


13500 


yLygccgayc 


a. y y ty a. Luuy 


raacatcaca 

w u. y ^ u l# w a w u 


ccrcrt caccct 

V**> W w w w 


cgcccacgat 


gctccgcacg 


13560 


■A— A A M *^ ~3 

LCCaQaCgaa 


gg uccy gg cia, 


y y w 1 1 w cl w 


acactattca 

^4 W w U. 


tacgc eg t a c 


gacctcctcg 


13620 


gcgCwCaCwy 


gutty utt u t 


otcotcotao 


tggacgacgt 


caacftaccca 


qtqcgcaacc 

Wj w^w^w^mww 


13680 


accccggaga 


cgtcccciccy 


y u> LivwaLy ty 


nrcaccatct 


ccaggcagcg 


t tccttgttc 


13740 


gcggt cgttg 


aCatytcgaL 


Ltt uuy aayy 


f (™* rt fi ci a c t a 


ctggtcacgc 


gccagccttc 


13800 


caacccgccc 


eggaaaageg 


y Ly LrciLy aL<L> 


npt"p;p;anPPC 
yt uy y ay u ^ ^ 


KM %^ f—L W W 


ctgcgcggcg 


13860 


gagcugaacg 


yyy LLL^yay 


rrattcacca 

O U ^wr l» W ^1 W W 


aggacctgcc 


gcag cc tgtt 


acggcacacc 


13920 


ctgacgcctc 


gctccgcgcg 


t^msi <** n c ei c* c 
yyaL-yuytL-L. 


y wt>yy yay y a 


a.acia.ca.ca>cc 


accatgtegg 


13980 


tacgcaccga 


t cagaeggeg 


yuaLL>yyaay 




naccacaoat 

Lwr \^ CL W V_* Vrf CL V 


cccggg t teg 


14040 


ggcacctgta 


cgcgcaggtg 


^ a ^* a /t 4™ 4™ /^+~ 

C aytdy u u 1 1 


dw>yLLuyyua 


y ci i>» y u ay^< ^— w 


ctcoactc ccs 


14100 


gcgcggccga 




^tr , r , ar , f , 4r+'/~ , s 
y LtatL ttta 


f^na rrrra r^nn 

i.* w y u y y a w y y 


cacattcocc 


cggccctcct 


14160 


cgccggaacc 


ggcacgcggc 


udtyttydciL 


4- rtrrppftpprfn 


w y \m* ^*y ^*y 


gccgccgaac 


14220 


gcct cgccgc 


egagggecuu 


4- nrtrarprinp 
LuytdtLyyt 


a n*f~ r* a +■ c fi rr 

CL U* y I** ^> CL i> w y ^ 


catcraccaco 


otacaccaaa 


14280 


aacccgacgg 


j> f-y <^r 4- /~f 4- 4" 

cayuy uy ttL' 


y uovrUu *y * 


acgcccaggt 


cttcgccacc 


CQCCqcqqqq 


14340 


aagct ccccg 


gctgcaLLLy 


a4* pf" n n4r /*-■ 4- 


nrrfadfiacot 


octcotacao 

U \+ W v-4 ^ \t4 ^4 


or a a a a cr c c q a 


14400 


ggctgaaggt. 


yoygyctciL^yy 


rt4~ 4- rt4r parnr 
y Luy ttatryt 


acoaccrcata 

^1^^ V-j \^ VH «4 v ^4 




acccgccggc 


14460 


cgagccgcac 


l_> Ly L taL t 


w 1 1> ^ w y y w w-* 


occaoccocic 

^4 ^p* w V*J WJ V-^ 


gtcgagtccg 


ctacqaaaQq 


14520 


n r*fi a +* s 
yCgCaCLLdy 


uy Lytyay ul> 


a^nactaact 

d ^ U uw ^ ^4 w 


cgacaggtcc 


ccgcccggtg 


cccgcca tgt 


14580 


f-» r+r+ry *~* /~* r+ 
UaLLLyLLyVyL 


uay ut>L>ua^y 


ccttcccc eg 


gccccgcccc 


caciaaqcqaa 


cccgcgccgc 


14640 


ucgccgLgau 


cy utautyyt 


rrrt rrrpT^ 4" en a 
yy uyy u l tyy 


atatcoacca 


ggccaccgcc 


cgcgccttcg 


14700 


»»» ^ 4" a /""t PI /"T 

i_ y i_ l. cayyy 


4- nprra a curt - n 
uy t> y ctay y l. y 


ctcatcatco 


gccgtaccga 


aaacacQctc 


ccqcaqaccq 


14760 


ccgagggcug 


uy Lyy atauy 


ty y i»y t L.ty 


tcoccoaccft 

^. w y w w y » w y w 


ggect cgccc 


qacqqqccqc 


14820 


ciy y w y y uwy u 




f"!" ocaoaacft 
o »y wyy y ^y *■* 


t egggaggat 


egaegt cc tg 


gtcaacaacg 


14880 


Cugccg uyyc 


ggguduyyay 




*~» pnt" pfiAPca 

s— w y uvm aw w \g 


ggacgccgtg 


ocacaacaat 


14940 


LCggCaCCaa 


LCtyacyytt 




tcotccaotc 

c« w y w w uy w» 


cgcactcggc 


qcqctqqaqa 
y^y^ ^yy y " 


15000 


ay ucycycgg 


4t /*t4~ o 
tdLuy Lty i— 1_ 


duty Lyyyy a 


ccgccgcgac 


cctgggcctg 


cgcgccgccc 


15060 


t_-ya.ti~-yyt-.yi— 


y u> uy Latyyy 




tggccctcga 


ctacctgacc 


cqqacc tgqq 


15120 


ccgtcgaact 


ggccccccgg 


ggcatccgtg 


tegteggegt 


ggcacccggg 


gtgatcgaca 


i c *i o r\ 
IbloU 


cgggcatcgg 


cgtccgcatg 


ggcatgaccc 


eggagggcta 


ccgggagttc 


ctgaccggca 


15240 


tgggcggcag 


ggtgcccgtg 


ggcegggteg 


gccgtccgga 


ggacgtggcc 


tggtggatcg 


15300 


tccagctcgc 


ccgcccggag 


gccggctacg 


egaegggcat 


ggtcgtcccc 


gtcgacggcg 


15360 


ggctgtcgct 


ggtctgaccg 


gacaaggaag 


gaaatacege 


aggaaggaag 


taccgcagca 


15420 
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aggaaatacc 


gcaggaagga 


gatatcgccg 


^ /™» /-w ^ q ^ ^» 

tycaggaaac 


f* ft ^ « /-» « /-» 

Cy daCCCyyC 


y l u u u o y l. y y 


J. J40U 


acctgcccgc 


cgagagcgac 


cctgccgccc 


4~ ft ft a ft f*ft ^ 
L_yyciyi~yi— O L. 


<_yL.wyi'CH'yy 


+- a f» f~> (-» f-r f< ft ft n 
tciu^yy ^yyy 


15540 


acggctacgt 


ccacgtcccc 


ggcgtcct eg 


— ^ i /»* ^ ft ft ft ft *~i 

ciU-y(-:i~yyyycl 


y y L.l~yL.L*ydCl. 


UAwL uyy wL>y 


15600 


aggcccgtcg 


gctcctcgcc 


cacgaggagt 




ftft ft ft~ f*f*ftftf* 
y y y i_ LL-L.yyu 


ftf> f* ft ft f* a f r* ft 

UL^yy uaLLy 


15660 


tcatggacta 


cgtcgccgac 


ft f* f* f a ft r** ft 


V-J k_* o.y ^ y ci w a i_* 


y cl luuUw v>« y w 


cttoccaccc 


15720 


acccgcgcat 


a* ^\ ^+ 

cgccycccLC 


y L~ y cty LQUV^ 




occcctaaoo 
y ^ w ^ ***y 3 


ctattcaaac 


15780 


tggaggtgct 


gctcaaggag 


aacaaggaga 


aggacgcctc 


ggtccccacc 


gccccgcacc 


15840 


acgatgcgtt 


cgccttcccg 


ttctccaccg 


ccggcaccgc 


cctgacggcg 


tgggtcgege 


15900 


tggtcgacgt 


cccggtggaa 


cgcggctgca 


tgaccttcgt 


ccccggatca 


cacctgctgc 


15960 


cggatcccga 


taccggcgac 


gagccgtggg 


ccggggcctt 


cacccggccg 


ggagagatct 


1 A090 


<210> 2 

<211> 342 

<212> PRT 

<213> Streptomyces 


/2ogaIater 


ATCC 27451 








<220> 
<223> 


translate of snoql , function: aminotransferase" 




<400> 2 















Met Thr Val His Val Trp Asp Tyr Leu Pro Glu Tyr Glu Leu Glu Arg 
15 10 15 

Glu Asp lie His Asp Ala Val Glu Thr Val Phe Arg Ser Gly Arg Leu 
20 25 30 

Val Leu Gly Glu Ser Val Arg Gly Phe Glu Ser Glu Phe Ala Ser Phe 
35 40 45 

Gin Gly Val Gly His Ala Val Gly Val Asp Asn Gly Thr Asn Ala Val 
50 55 60 

Lys Leu Gly Leu Gin Ala Leu Gly Val Gly Pro Gly Asp Glu Val Val 
65 70 75 80 

Thr Val Ser Asn Thr Ala Ala Pro Thr Val Val Ala lie Asp Ser Ala 
85 90 95 

Gly Ala Thr Pro Val Phe Val Asp Val Arg Glu Glu Asp Tyr Leu Met 
100 105 110 

Asp Thr Ser Gin Val Glu Ala Val Leu Thr Pro Arg Thr Arg Cys Leu 
115 120 125 

Leu Pro Val His Leu Tyr Gly Gin Cys Val Asp Met Ala Pro Leu Arg 
130 135 140 

Asp Leu Ala Ala Arg His Asn Leu Val lie Leu Glu Asp Cys Ala Gin 
145 150 155 160 

Ala His Gly Ala Arg Arg His Gly Arg Leu Ala Gly Ser Thr Gly Asp 
165 170 175 



Ala Ala Ala Phe Ser Phe Tyr Pro Thr Lys Val Leu Gly Ala Tyr Gly 
180 185 190 
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Asp Gly Gly Ala Val Leu Thr Asp Asp Glu Arg Val Ala Asp Arg Leu 
195 200 205 

Arg Arg Leu Arg Tyr Tyr Gly Met Glu Ser Arg Tyr Tyr Val Val Glu 
210 215 220 

Thr Pro Gly His Asn Ser Arg Leu Asp Glu Val Gin Ala Glu lie Leu 
225 230 235 240 

Arg Arg Lys Leu Ser Arg Leu Pro Ser Tyr lie Glu Ala Arg Arg Ala 
245 250 255 

Val Ala Arg Arg Tyr Glu Glu Gly Leu Ala Asp Thr Gly Leu Leu Leu 
260 265 270 

Pro Arg Thr Ala Gin Gly Asn Glu His Val Tyr Tyr Val Tyr Val Val 
275 280 285 

Arg His Pro Arg Arg Asp Ala Val Leu Glu Ala Leu Arg Ala Ser Tyr 
290 295 300 

Asp lie Ala Leu Asn lie Ser Tyr Pro Trp Pro Val His Thr Met Thr 
305 310 315 320 

Gly Phe Ser His Leu Gly Tyr Ala Lys Gly Ser Leu Pro Val Thr Glu 
325 330 335 

Ala Leu Ala Asp Glu lie 
340 

<210> 3 
<211> 293 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snog J , function: dTDP-glucose synthase*' 



<400> 



Val Lys Gly lie lie Leu Ala Gly Gly Thr Gly Ser Arg Leu His Pro 
15 10 15 

Thr Thr Leu Ala Val Ser Lys Gin Leu Leu Pro Val Gly Asp Lys Pro 
20 25 30 

Met lie Tyr Tyr Pro Leu Ser Val Leu Met Leu Ala Gly Val Thr Asp 
35 40 45 

lie Leu lie lie Ser Thr Pro His Glu Leu Pro Arg Met Arg Arg Leu 
50 55 60 

Phe Gly Asp Gly Ala Gin Leu Gly Leu Arg Leu Ala Tyr Ala Glu Gin 
65 70 75 80 

Glu Lys Pro Arg Gly lie Ala Glu Ala Phe Leu He Gly Ala Asp His 
85 90 95 

Val Gly Ser Asp Ala Val Ala Leu Ala Leu Gly Asp Asn He Phe His 
100 105 110 

Gly Ser Ser Phe Gin Gly Val Leu Arg Lys Glu Ala Glu Glu Leu Asp 
115 120 125 

Gly Cys Val Leu Phe Gly Tyr Pro Val Lys Asp Pro Gin Arg Tyr Gly 
130 135 140 
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Val Gly Glu Ala Asn Ala Ser Gly Arg Leu Val Ser lie Glu Glu Lys 
145 150 155 160 

Pro Val Arg Pro Arg Ser Asn Arg Ala lie Thr Gly Leu Tyr Phe Tyr 
165 170 175 

Asp Asn Glu Val Val Asp lie Ala Arg Arg Leu Arg Pro Ser Ala Arg 
180 185 190 

Gly Glu Leu Glu lie Thr Asp lie Asn Arg Thr Tyr Met Glu Arg Gly 
195 200 205 

Arg Ala Arg Leu Val Asp Leu Gly Arg Gly Phe Ala Trp Leu Asp Thr 
210 215 220 

Gly Thr Pro Glu Ser Leu Leu Gin Ala Ser Gin Tyr Val Ser Ala Leu 
225 230 235 240 

Glu Glu Arg Gin Gly lie Arg lie Ala Cys lie Glu Glu Val Ala Leu 
245 250 255 

Arg Met Gly Phe lie Asn Ala Gin Ala Cys Tyr Glu Leu Gly Ala Arg 
260 265 270 

Leu Ser Gly Ser Gly Tyr Gly Gin Tyr Val Met Ala lie Ala Glu Glu 
275 280 285 

Cys Thr Gly Arg Val 
290 

<210> 4 

<211> 238 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snogh, function: aminomethyl transferase" 



<400> 



Val Tyr Gly Arg Glu Leu Ala Asp Val Tyr Glu Met Val Tyr Arg Ser 
15 10 15 

Arg Gly Lys Ser Trp Ala Asp Glu Ala Glu Arg Val Thr Ala Glu lie 
20 25 30 

Arg Ser Arg Arg Pro Gly Ala Arg Ser Leu Leu Asp Val Ala Cys Gly 
35 40 45 

Thr Gly Ala His Leu Glu Ala Phe Arg Gly Leu Phe Ala His Thr Glu 
50 55 60 

Gly Leu Glu Leu Ser Asp Glu Met Arg Ala Leu Ala Glu Arg Arg Leu 
65 70 75 80 

Pro Gly Val Pro Val Arg Pro Gly Asp Met Arg Asp Phe Ala Leu Ser 
85 90 95 

Gly Arg Phe Asp Ala Val Val Cys Leu Phe Cys Ser lie Gly Tyr Leu 
100 105 110 

Glu Thr Val Ala Asp Met Arg Ala Ala Val Arg Thr Met Ala Ala His 
115 120 125 

Leu Val Pro Gly Gly Val Leu Val Val Glu Pro Trp Trp Phe Pro Glu 
130 135 140 
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Arg Phe Leu Glu Gly Tyr Val Ala Gly Asp Leu Ala Arg Gly Glu Gly 
14 5 150 155 160 

Arc* Thr Val Ala Arg Val Ser His Ser Thr Arg Gin Gly Arg Arg Thr 
165 170 175 

Arq Met Glu Val Arg Phe Leu Val Gly Glu Ala Thr Gly lie Arg Glu 
y 180 185 190 

Phe Thr Glu lie Asp Leu Leu Thr Leu Phe Thr Arg Glu Glu Tyr Leu 
195 200 205 

Ala Ala Phe Glu Asp Ala Gly Cys Pro Ala Glu Phe Leu Asp Asp Gly 
210 215 220 

Leu Thr Gly Arg Gly Leu Phe Val Gly Val Arg Gly Ala Gly 
225 230 235 

<210> 5 

<211> 324 

<212> PRT 

<213> streptomyces nogalater ATCC 27451 

<220> , , 

<223> "translate of snoaM, function: polyketide cyclase 

<400> 5 

Met Thr Ala Ala Trp Gly Ala Pro Leu Tyr Pro Pro Trp He Pro Ala 
1 5 10 15 

Arg Pro Gly Arg Arg Arg Cys Gly Ala Gly Arg Arg Val Arg Cys Pro 
20 25 30 

Pro Val Glu Pro Ala Ser Arg Pro Arg Gin Glu Gly Arg Val Ser Val 
35 40 45 

Val Pro Ala Leu Arg Gin Pro Ser Pro Ser Thr Asn Pro Glu Val Arg 
50 55 60 

Val Ara Leu He Asp Leu Ser Ser Pro Val Asp Ser Ser Gin Tyr Glu 
65 70 75 80 

Pro Asp Pro Val Val His Asp Val Leu Thr Pro Arg Gin Gly Ala Glu 
85 90 95 

His Met Cys Ala Glu Met Arg Glu His Phe Gly Val Glu Phe Ser Pro 
100 105 HO 

Asp Glu Leu Pro Asp Gly Glu Phe Leu Ser Leu Asp Arg He Thr Leu 
115 120 125 

Thr Thr His Thr Gly Thr His Val Asp Ala Pro Ser His Tyr Gly Ser 
130 135 140 

Ara Ala Leu Tyr Gly Asp Gly Val Pro Arg His He Asp Gin Met Pro 
145 150 155 160 

Leu Glu Trp Phe Phe Gly Arg Gly Val Val Leu Asp Leu Thr Asp Ala 
165 170 175 

Pro Thr Gly Thr Val Ser Ala Ala Arg Leu Glu Lys Glu Leu Ala Arg 
180 185 190 

Thr Gly Cys Ala Leu Arg Pro Gly Asp He Val Leu Leu His Thr Gly 
195 200 205 
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Ala 


Gin 
210 


Arg 


His 


Ala 


Gly 


Thr 
215 


Pro 


Arg 


Tyr 


Phe 


Thr 
220 


Asp 


Phe 


Ala 


Gly 


Leu 
225 


Asp 


Gly 


Pro 


Ala 


Val 
230 


Arg 


Met 


Leu 


Leu 


Asp 
235 


His 


Gly 


Val 


Arg 


Val 
240 


lie 


Gly 


Thr 


Asp 


Ala 
245 


Phe 


Ser 


Leu 


Asp 


Ala 
250 


Pro 


Phe 


Gly 


His 


He 
255 


He 


Asp 


Arg 


Tyr 


Arg 
260 


Ala 


Thr 


Gly 


Asp 


Arg 
265 


Ser 


Val 


Leu 


Trp 


Pro 
270 


Ala 


His 


Val 


Val 


Gly 
275 


Arg 


Glu 


Arg 


Glu 


Tyr 
280 


Cys 


Gin 


He 


Glu 


Arg 
285 


Leu 


Ala 


Asn 


Leu 


Asp 
290 


Arg 


Leu 


Pro 


Val 


Ser 
295 


Phe 


Gly 


Phe 


Arg 


Val 
300 


Cys 


Cys 


Phe 


Pro 


Val 
305 


Lys 


Val 


Ala 


Gly 


Ala 
310 


Gly 


Ala 


Gly 


Trp 


Thr 
315 


Arg 


Ala 


Val 


Ala 


Leu 
320 


Val 


Asp 


Glu 


Asp 


























vol fW 

<211> 
<212> 
<213> 


6 

408 
PRT 

Streptomyces nogalater - 


ATCC 


27451 














<220> 
<223> 


"translate 


of snogN 


, function: 


unknown" 












<400> 


6 






























Met 
1 


Val 


Met 


Lys 


Leu 
5 


Thr 


Asp 


Ser 


Glu 


Leu 
10 


Gly 


Arg 


Ala 


Leu 


Leu 
15 


Ser 


Leu 


Arg 


Gly 


Tyr 
20 


Gin 


Trp 


Leu 


Arg 


Gly 
25 


He 


His 


His 


Asp 


Pro 
30 


Tyr 


Ala 


Leu 


Leu 


Leu 
35 


Arg 


Ala 


Glu 


Ser 


Asp 
40 


Asp 


Pro 


Ala 


Gin 


Leu 
45 


Gly 


Arg 


Leu 


Leu 


Arg 
50 


Glu 


Arg 


Gly 


Arg 


Leu 
55 


His 


Arg 


Ser 


Asp 


Thr 
60 


Gly 


Thr 


Trp 


Val 


Thr 
65 


Ala 


Asp 


His 


Ala 


Thr 
70 


Ala 


Ser 


Arg 


Leu 


Leu 
75 


Ala 


Asp 


Pro 


Arg 


Phe 
80 


Val 


Leu 


Arg 


Arg 


Pro 
85 


Pro 


Ala 


Gly 


Pro 


Ala 
90 


Thr 


Gly 


Thr 


Gly 


Asp 
95 


Val 


Met 


Pro 


Trp 


Glu 
100 


Glu 


Ala 


Thr 


Leu 


Ser 
105 


Asp 


Leu 


Leu 


Pro 


Leu 
110 


Asp 


Glu 


Ala 


Arg 


Leu 
115 


Thr 


Thr 


Asp 


Arg 


Ala 
120 


Arg 


Cys 


Arg 


Arg 


Leu 
125 


Gly 


Ala 


Thr 


Ala 


Ala 
130 


Arg 


He 


Ala 


Ala 


Asp 
135 


Gly 


Pro 


Val 


Ala 


Thr 
140 


Arg 


Leu 


Ala 


Asp 


Leu 
145 


Ala 


Gly 


Ala 


Arg 


Ala 
150 


Glu 


Gin 


Val 


Arg 


Ser 
155 


Thr 


Gly 


His 


Phe 


Asp 
160 


Leu 


Arg 


Ala 


Asp 


Tyr 
165 


Ala 


Leu 


Pro 


Tyr 


Ala 
170 


Val 


Glu 


Pro 


Ala 


Cys 
175 


Ala 
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Leu Leu Gly Leu Pro Ala Gly Gin Cys Ser Leu Phe Gly Ala Phe Ser 
180 185 190 

Pro Ala Val Leu Leu Asp Ala Thr Val Val Pro Pro Arg Leu Pro Glu 
195 200 205 

Ala Arg Ala Leu He Ala Ser Thr Ala Glu Leu Thr Ala Leu Trp Pro 
210 215 220 

Arg Leu Ala Pro Ser Leu Ser Lys Thr Val Pro Glu Asp Glu Ala Pro 
225 230 235 240 

Asp Leu Phe Leu Leu Thr Ala Val Leu Leu Val Pro Ala Val Val His 
245 250 255 

Leu Val Cys Glu Ala Val Ala Ala Leu Ser His Asp Pro Gly Gin Ala 
260 265 270 

Gly Leu Leu Arg Asp Asp Pro Val Leu Ala Ala Pro Ala Val Glu Glu 
275 280 285 

Thr Leu Arg His Ala Pro Pro Ala Arg Leu Phe Thr Leu His Ala Thr 
290 295 300 

Gly Pro Glu Arg Val Ala Asp Val Asp Leu Pro Ala Gly Ala Glu Val 
305 310 315 320 

Ala Val Val Val Ala Ala Ala His Arg Asp Pro Ser Trp Cys Pro Asp 
325 330 335 

Pro Asp Arg Phe Asp Leu Thr Arg Asn Glu Arg His Leu Ala Leu Pro 
340 345 350 

Pro Asp Leu Pro Leu Gly Ala Leu Ala Pro Leu Leu Arg Val Cys Ala 
355 360 365 

Thr Ala Ala Val Ala Ala Leu Ala Ala Gly Leu Leu Pro Leu Arg Ala 
370 375 380 

Val Gly Pro Pro Val Arg Arg Leu Arg Ala Pro Val Thr Arg Ser Val 
385 390 395 400 

Leu Arg Phe Pro Val Ala Pro Cys 
405 

<210> 7 
<211> 422 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snoaG , function: hydroxylase" 



<400> 



Met Asp Asn Arg Glu Thr Val Arg Pro Val Ser Val Cys Arg Val Cys 
15 10 15 

Gly Gly Asn Asp Trp Gin Asp Val Val Asp Phe Gly Asp Val Pro Leu 
20 25 30 

Ala Asn Gly Phe Leu Ser Pro Ala Asp Ser Tyr Glu Asn Glu Arg Arg 
35 40 45 

Tyr Pro Leu Gly Val Leu Ser Cys Arg Ala Cys Arg Leu Met Ser Leu 
50 55 60 
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Thr His Val Val Asp Pro Glu Val Leu Tyr Arg Asp Tyr Ala Tyr Thr 
65 70 75 80 

Thr Pro Asp Ser Glu Met lie Thr Gin His Met Arg His He Thr Ala 
85 90 95 

Leu Cys Arg Thr Arg Phe Glu Leu Pro Pro Asp Ser Leu Val Val Glu 
100 105 110 

Leu Gly Ser Asn Thr Gly Arg Gin Leu Met Ala Phe Arg Glu Ala Gly 
115 120 125 

Met Arg Thr Leu Gly Val Asp Pro Ala Arg Asn Leu Thr Asp Val Ala 
130 135 140 

Arg Arg Asn Gly He Glu Thr Phe Pro Asp Phe Phe Ser His Asp Val 
145 150 155 160 

Ala Arg Thr He Arg Arg Asp His Gly Gin Ala Arg Leu Val Leu Gly 
165 170 175 

Arg His Val Phe Ala His He Asp Asp Val Ser Asp He Ala Ala Gly 
180 185 190 

Val Arg Glu Leu Leu Ser Pro Asp Gly Val Phe Ala He Glu Val Pro 
195 200 205 

Tyr Val Leu Asp Leu Leu Glu Lys Val Ala Phe Asp Thr He Tyr His 
210 215 220 

Glu His Leu Ser Tyr Phe Thr Met Arg Ser Phe Val Thr Leu Phe Ala 
225 230 235 240 

Arg His Gly Leu Arg Val Leu Asp Val Glu Arg Phe Gly Val His Gly 
245 250 255 

Gly Ser Val Leu Val Phe Val Gly His Glu Asp Gly Pro Trp Pro Glu 
260 265 270 

Arg Pro Ser Val Pro Glu Leu Leu Arg Val Glu Arg Gin Arg Gly Leu 
275 280 285 

Tyr Asp Asp Ala Thr Tyr Arg Thr Phe Ala Gin Arg He Glu Arg Val 
290 295 300 

Arg Thr Glu Leu Pro Glu Leu Leu Arg Ser Leu Val Ala Gin Gly Lys 
305 310 315 320 

Arg He Val Gly Tyr Gly Ala Pro Ala Lys Gly Asn Thr He Leu Thr 
325 330 335 

Val Cys Gly Leu Gly Leu Lys Glu Leu Glu Tyr Cys Thr Asp Thr Thr 
340 345 350 

Glu Leu Lys Gin Gly Arg Val Leu Pro Gly Thr His He Pro Val His 
355 360 365 

Ala Pro Glu His Ala Lys Glu His He Pro Asp Tyr Tyr Leu Leu Leu 
370 375 380 

Ala Trp Asn Tyr Ala Thr Glu He Leu Asp Lys Glu Thr Ala Phe Arg 
385 390 395 400 

Asp Asn Gly Gly Arg Phe He Val Pro He Pro Arg Pro Ser He Leu 
405 410 415 
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Thr Ser Pro Ser Gly Ser 
420 

<210> 8 
<211> 291 
<212> PRT 

<213> streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snoqC, function: dTDP-4-dehydrorhamnose reductase" 

<400> 8 

Met Leu Ala Arg His Leu Thr Ala Ala Leu Ala Glu Thr Gly Arg Ser 
15 10 15 

Arg Pro Ala Ala Glu Ala Val Val Leu Gly Arg Arg Ala Leu Asp lie 
20 25 30 

Thr Asp Gly Arg Ala Val Asp Ala Ala Phe Ala Ala His Arg Pro Arg 
35 40 45 

Val Val Val Asn Cys Ala Ala Phe Thr Asp Val Asp Gly Ala Glu Ser 
50 55 60 

Arg Trp Ala Glu Ala Met Arg Val Asn Gly Gly Gly Pro Arg Leu Leu 
65 70 75 80 

Ala Arg Arg Cys Ala Arg His Gly Val Arg Leu lie His Val Ser Thr 
85 90 95 

Asp Tyr Val Phe Pro Gly Asp Thr Arg Ser Pro Tyr Gly Glu Ser Asp 
100 105 110 

Ala Pro Gly Pro Arg Thr Val Tyr Gly Arg Ser Lys Leu Ala Gly Glu 
115 120 125 

Arg Ala Val Leu Ser Leu Leu Pro Asp Thr Gly Thr Val Val Arg Thr 
130 135 140 

Ala Trp Leu Tyr Gly Gly Gin Gly Arg Ser Phe Val Arg Thr Met Leu 
145 150 155 160 

Glu Arg Ala Pro Asp Asp Gly His Val Asp Val Val Asn Asp Gin Trp 
165 170 175 

Gly Gin Pro Thr Trp Ala Gly Asp Val Ala Arg Leu Leu Val Thr Leu 
180 185 190 

Ala Arg Thr Pro Pro Asp Arg Ala Arg Gly lie Phe His Ala Thr Asn 
195 200 205 

Ala Gly Ala Ala Thr Trp Tyr Glu Leu Ala Arg Glu Val Phe Arg Leu 
210 215 220 

Ala Gly Ala Asp Pro Glu Arg Val Arg Pro Val Ala Thr Ala Asp Arg 
225 230 235 240 

Pro Gly Pro Ala Pro Arg Pro Ala Cys Thr Val Leu Gly His Asp Arg 
245 250 255 

Trp Arg Leu Val Gly Val Ala Pro Pro Arg Asp Trp Arg Ala Ala Leu 
260 265 270 

Arg Glu Ala Met Arg Gin Leu Leu Pro Gly Gly Arg Leu Arg Asn Leu 
275 280 285 
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Thr Gly Thr 
290 

<210> 9 

<211> 350 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snogK, function: dTDP-glucose-4 , 6-dehydratase" 



<400> 



Met Ala Ser His Thr Ser Ala Thr Thr Asp Val Asn lie Leu Val Thr 
15 10 15 

Gly Ala Val Gly Phe lie Gly Ser Ala Tyr Val Arg Met Leu Leu Glu 
20 25 30 

Asn Arg Ala Pro Gly Ala Gly Ala Pro Ala Val Arg Val Thr Val Leu 
35 40 45 

Asp Lys Leu Thr Tyr Ala Gly Asn Leu Thr Asn Leu Asp Ala Val Arg 
50 55 60 

Gly Asp Arg Leu Arg Phe Val Arg Gly Asp lie Leu Asp Ala Glu Leu 
65 70 75 80 

Val Asp Glu Leu Met Ala His Ser Asp Gin Val Val His Phe Ala Ala 
85 90 95 

Glu Ser His Val Asp Arg Ser lie Arg Ala Ala Asp Asp Phe Val Leu 
100 105 110 

Thr Asn Val Val Gly Thr Gin Arg Leu Leu Asp Ala Ala Leu Arg His 
115 120 125 

Gly Val Glu Pro Phe Val Leu Val Ser Thr Asp Glu Val Tyr Gly Ser 
130 135 140 

lie Ala Ser Gly Ser Trp Pro Glu Glu His Pro Leu Ser Pro Asn Ser 
145 150 155 160 

Pro Tyr Ala Ala Ser Lys Ala Ser Ala Asp Leu Met Ala Phe Ala Cys 
165 170 175 

His Arg Thr His Gly Leu Asp Val Arg Val Thr Arg Cys Ser Asn Asn 
180 185 190 

Tyr Gly Pro Arg Gin His Pro Glu Lys Leu lie Pro Arg Phe Val Thr 
195 200 205 

Asn Leu Leu Asp Gly Leu Pro Val Pro Leu Tyr Gly Asp Gly Arg Asn 
210 215 220 

Val Arg Glu Trp Leu His Val Glu Asp His Cys Arg Gly Val Asp Leu 
225 230 235 240 

Val Arg Thr Ala Gly Arg Pro Gly Gly Val Tyr His lie Gly Gly Gly 
245 250 255 

Arg Glu Leu Ser Asn Arg Glu Leu Val Gly Met Leu Leu Glu Leu Cys 
260 265 270 

Gly Ala Asp Trp Ser Ser Val Arg His Val Pro Asp Arg Lys Gly His 
275 280 285 
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Asp Leu Arg Tyr Ser Leu Asp Trp Gly Arg Ala Arg Glu Glu Leu Gly 
290 295 300 

Tyr Arg Pro Ala Arg Glu Phe Ser Ser Gly Leu Arg Ser Thr Val Gin 
305 310 315 320 

Trp Tyr Arg Glu Asn Arg Ser Trp Trp Glu Pro Leu Lys Arg Gly Val 
325 330 335 

Thr Ala Pro Gly Gly Thr Ser Thr Val Val Pro Gly Val Arg 
340 345 350 

<210> 10 
<211> 134 
<212> PRT 

<213> StreptomyCBS nogalater ATCC 27451 

<220> 

<223> "translate of snoah, function: NAME cyclase" 

<400> 10 

Met Val Ser Ala Phe Asn Thr Gly Arg Thr Asp Asp Val Asp Glu Tyr 
15 10 15 

lie His Pro Asp Tyr Leu Asn Pro Ala Thr Leu Glu His Gly lie His 
20 25 30 

Thr Gly Pro Lys Ala Phe Ala Gin Leu Val Gly Trp Val Arg Ala Thr 
35 40 45 

Phe Ser Glu Glu Ala Arg Leu Glu Glu Val Arg lie Glu Glu Arg Gly 
50 55 60 

Pro Trp Val Lys Ala Tyr Leu Val Leu Tyr Gly Arg His Val Gly Arg 
65 70 75 80 

Leu Val Gly Met Pro Pro Thr Asp Arg Arg Phe Ser Gly Glu Gin Val 
85 90 95 

His Leu Met Arg lie Val Asp Gly Lys lie Arg Asp His Arg Asp Trp 
100 105 110 

Pro Asp Phe Gin Gly Thr Leu Arg Gin Leu Gly Asp Pro Trp Pro Asp 
115 120 125 

Asp Glu Gly Trp Arg Pro 
130 

<210> 11 
<211> 235 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snoK, function: unknown" 

<400> 11 

Met Pro Asp Pro Gly Gly Pro Thr Thr Ala Glu Asn Leu Ser Lys Glu 
15 10 15 

Ala Val Arg Phe Tyr Arg Glu Gin Gly Tyr Val His lie Pro Arg Val 
20 25 30 

Leu Ser Glu Thr Glu Val Thr Ala Phe Arg Ala Ala Cys Glu Glu Val 
35 40 45 
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Leu 


Glu 
50 


Lys 


Glu 


Gly 


Arg 


Glu 
55 


He 


Ser 


Gly 


He 


Ala 
60 


Leu 


Arg 


Leu 


Ala 


Gly 
65 


Ala 


Pro 


Leu 


Arg 


Val 
70 


Tyr 


Ser 


Ser 


Asp 


He 
75 


Leu 


Val 


Lys 


Glu 


Pro 
80 


Lys 


Arg 


Thr 


Leu 


Pro 
85 


Thr 


Leu 


Val 


His 


Asp 
90 


Asp 


Glu 


Thr 


Gly 


Leu 
95 


Pro 


Leu 


Asn 


Glu 


Leu 
100 


Ser 


Ala 


Thr 


Leu 


Thr 
105 


Ala 


Trp 


He 


Ala 


Leu 
110 


Thr 


Asp 


Val 


Pro 


Val 
115 


Glu 


Arg 


Gly 


Cys 


Met 
120 


Ser 


Tyr 


Val 


Pro 


Gly 
125 


Ser 


His 


Leu 


Arg 


Ala 
130 


Arg 


Glu 


Asp 


Arg 


Gin 
135 


Glu 


His 


Met 


Thr 


Ser 
140 


Phe 


Ala 


Glu 


Phe 


Arg 
145 


Asp 


Leu 


Ala 


Asp 


Val 
150 


Trp 


Pro 


Asp 


Tyr 


Pro 
155 


Trp 


Gin 


Pro 


Arg 


Val 
160 


Ala 


Val 


Pro 


Val 


Arg 
165 


Ala 


Gly 


Asp 


Val 


Val 
170 


Phe 


His 


His 


Cys 


Arg 
175 


Thr 


Val 


His 


Met 


Ala 
180 


Glu 


Ala 


Asn 


Thr 


Ser 
185 


Asp 


Ser 


Val 


Arg 


Met 
190 


Ala 


His 


Gly 


Val 


Val 
195 


Tyr 


Met 


Asp 


Ala 


Asp 
200 


Ala 


Thr 


Tyr 


Arg 


Pro 
205 


Gly 


Val 


Gin 


Asp 


Gly 
210 


His 


Leu 


Ser 


Arg 


Leu 
215 


Ser 


Pro 


Gly 


Asp 


Pro 
220 


Leu 


Glu 


Gly 


Glu 


Leu 


Phe 


Pro 


Leu 


Val 


Thr 
230 


Ala 


Gly 


Thr 


Arg 


Gin 
235 












<211> 
<212> 
<zlo> 


12 

390 

PRT 

Streptomyces nogalater 


ATCC 


27451 














<220> 
<22o> 


"translate 


of snogD 


, function: 


glycosy 1 


transferase 






<400> 


12 






























Met 
1 


Arg 


Val 


Pro 


Gly 
5 


Ser 


Cys 


Arg 


Thr 


Gly 
10 


Gly 


He 


Met 


Arg 


Ala 
15 


Leu 


Phe 


lie 


Thr 


Ser 
20 


Pro 


Gly 


Leu 


Ser 


His 
25 


He 


Leu 


Pro 


Thr 


Val 
30 


Pro 


Leu 


Ala 


Gin 


Ala 
35 


Leu 


Arg 


Ala 


Leu 


Gly 
40 


His 


Glu 


Val 


Arg 


Tyr 
45 


Ala 


Thr 


Gly 


Gly 


Asp 
50 


He 


Arg 


Ala 


Val 


Ala 
55 


Glu 


Ala 


Gly 


Leu 


Cys 
60 


Ala 


Val 


Asp 


Val 


Ser 
65 


Pro 


Gly 


Val 


Asn 


Tyr 
70 


Ala 


Lys 


Leu 


Phe 


Val 
75 


Pro 


Asp 


Asp 


Thr 


Asp 
80 


Val 


Thr 


Asp 


Pro 


Met 
85 


His 


Ser 


Glu 


Gly 


Leu 
90 


Gly 


Glu 


Gly 


Phe 


Phe 
95 


Ala 


Glu 


Met 


Phe 


Ala 
100 


Arg 


Val 


Ser 


Ala 


Val 
105 


Ala 


Val 


Asp 


Gly 


Ala 
110 


Leu 


Arg 
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Thr Ala Arg Ser Trp Arg Pro Asp Leu Val Val His Thr Pro Thr Gin 
115 120 125 

Glv Ala Gly Pro Leu Thr Ala Ala Ala Leu Gin Leu Pro Cys Val Glu 
130 135 140 

Leu Pro Leu Gly Pro Ala Asp Ser Glu Pro Gly Leu Gly Ala Leu lie 
145 150 155 160 

Ara Arq Ala Met Ser Lys Asp Tyr Glu Arg His Gly Val Thr Gly Glu 
y 165 170 175 

Pro Thr Gly Ser Val Arg Leu Thr Thr Thr Pro Pro Ser Val Glu Ala 
180 185 190 

Leu Leu Pro Glu Asp Arg Arg Ser Pro Gly Ala Trp Pro Met Arg Tyr 
195 200 205 

Val Pro Tyr Asn Gly Gly Ala Val Leu Pro Asp Trp Leu Pro Pro Ala 
210 215 220 

Ala Gly Arg Arg Arg lie Ala Val Thr Leu Gly Ser lie Asp Ala Leu 
225 230 235 240 

Ser Glv Gly He Ala Lys Leu Ala Pro Leu Phe Ser Glu Val Ala Asp 
245 250 255 

Val Asp Ala Glu Phe Val Leu Thr Leu Gly Gly Gly Asp Leu Ala Leu 
260 265 270 

Leu Gly Glu Leu Pro Ala Asn Val Pro Val Val Glu Trp He Pro Leu 
275 280 285 

Gly Ala Leu Leu Glu Thr Cys Asp Ala He He His His Gly Gly Ser 
290 295 300 

Glv Thr Leu Leu Thr Ala Leu Ala Ala Gly Val Pro Gin Cys Val He 
305 310 315 320 

Pro His Gly Ser Tyr Gin Asp Thr Asn Arg Asp Val Leu Thr Gly Leu 
325 330 335 

Glv He Gly Phe Asp Ala Glu Ala Gly Ser Leu Gly Ala Glu Gin Cys 
* 340 345 350 

Arg Arg Leu Leu Asp Asp Ala Gly Leu Arg Glu Ala Ala Leu Arg Val 
355 360 365 

Arg Gin Glu Met Ser Glu Met Pro Pro Pro Ala Glu Thr Ala Ala Lys 
370 375 380 

Leu Val Ala Leu Ala Gly 
385 390 

<210> 13 

<211> 275 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snoVl , function: unknown" 

<400> 13 

Met Thr Val Leu Val Thr Gly Ala Thr Gly Asn Val Gly Arg His Val 

10 15 



1 5 
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Val Thr Gly Leu Leu Ala Ala Gly Arg Arg Val Arg Ala Leu Thr Arg 
20 25 30 

Thr Pro Asp Arg Ser Gly Leu Pro Gly Gly Ala Glu lie Thr Gly Gly 
35 40 45 

Asp Leu Thr Arg Pro Glu Thr Tyr Glu Arg Met Leu Asp Gly Val Glu 
50 55 60 

Ala Val Tyr Leu Phe Pro Val Pro Glu Thr Ala Ala Ala Phe Ala Gly 
65 70 75 80 

Ala Ala Arg Arg Ala Gly Val Arg Arg He Val Val Leu Ser Ser Asp 
85 90 95 

Ser Val Thr Asp Gly Thr Asp Thr Gly Gly His Arg Arg Val Glu Leu 
100 105 HO 

Ala Val Glu Asp Thr Gly Leu Glu Trp Thr His Val Arg Pro Gly Glu 
115 120 125 

Phe Ala Leu Asn Lys Val Thr Leu Trp Ala Pro Ser He Arg Ala Glu 
130 135 140 

Glv Val Val Arg Ser Ala Tyr Pro Asp Ala Arg Val Ala Pro Val His 
14 5 150 155 160 

Glu Ala Asp Val Ala Ala Val Ala Val Thr Ala Leu Leu Lys Glu Gly 
165 170 175 

His Ala Gly Arg Ala Tyr Ser Val Thr Gly Pro Gin Ala Leu Thr Gin 
180 I 85 190 

Arq Glu Gin Val Arg Ala Val Gly Glu Gly Leu Gly Arg Ser Leu Ala 
195 200 205 

Phe Val Glu Val Thr Pro Gly Gin Ala Arg Ala Asp Leu Thr Ala Gin 
210 215 220 

Glv Leu Pro Ala Pro He Ala Asp Tyr Val Leu Ala Phe Gin Ala Gly 
225 230 235 240 

Trp Thr Glu Arg Pro Ala Pro Ala Arg Pro Thr Val Arg Glu Val Thr 
245 250 255 

Glv Arq Pro Ala Arg Thr Leu Ala Gin Trp Ala Ala Asp His Arg Ala 
* 260 265 270 

Asp Phe Arg 
275 

<210> 14 
<211> 424 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snogE, function: glycosyl transferase" 

<400> 14 

Val Arg Val Leu Leu Thr Ser Phe Ala Met Asp Ala His Phe Cys Thr 
15 10 15 

Ala Val Pro Leu Ala Trp Ala Leu Arg Ser Ala Gly His Glu Val Arg 
20 25 30 
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Val Ala Gly Gin Pro Ala Leu Thr Ser Thr He Thr Gly Ala Gly Leu 
35 40 45 

Thr Ala Val Pro Val Gly Arg Asp His Thr His Gly Ser Leu Leu Gly 
50 55 60 

Arq Val Gly Ser Asp He Leu Ala Leu His Asp Glu Ala Asp Tyr Leu 
65 70 75 80 

Glu Ala Arg His Asp Ala Leu Gly Phe Glu Phe Leu Lys Gly His Asn 
85 90 95 

Thr Val Met Ser Ala Leu Phe Tyr Ser Gin He Asn Asn Asp Ser Met 
105 HO 



100 



Val Asp Asp Leu Val Asp Phe Ala Arg His Trp Arg Pro Asp Leu Val 
115 120 125 

Val Trp Glu Pro Phe Thr Phe Ala Gly Ala Val Ala Ala Arg Ala Ser 
130 135 140 

Glv Ala Ala His Ala Arg Leu Leu Ser Phe Pro Asp Leu Phe Leu Ser 
14 | 150 155 l&O 

Thr Arg Arg Leu Phe Leu Glu Arg Met Ala Arg Gin Glu Pro Glu His 
165 170 175 

His Asp Asp Thr Leu Ala Glu Trp Leu Asp Trp Thr Leu Gly Arg His 



180 



Glv His Ser Phe Asp Glu Glu lie Val Thr Gly Gin Trp Ser He Asp 
* 195 200 205 

Gin Thr Pro Ala Pro Val Arg Leu Asp Ala Gly Gly Pro Thr Val Pro 
2io 215 220 

Met Arg Tyr Val Pro Tyr Ser Gly Leu Val Pro Thr Val Val Pro Asp 
225 230 235 240 

Trp Leu Arg Arg Pro Pro Glu Arg Pro Arg Val Leu Val Thr Leu Gly 
245 250 255 

He Thr Ser Arg Arg Val Lys Ser Phe Leu Ala Val Ser Val Asp Asp 
265 270 



260 



Leu Phe Glu Ala Val Ala Gly Leu Gly Val Glu Val Val Ala Thr Leu 
275 280 285 

Asd Ala Asp Gin Arg Glu Leu Leu Gly Arg Val Pro Asp His Phe Arg 
* 290 295 300 

He Val Glu His Val Pro Leu Asp Ala Val Leu Pro Thr Cys Ser Ala 
305 310 315 320 

He Val His His Gly Gly Ala Gly Thr Trp Ser Thr Ala Ala Val Tyr 
325 330 335 

Glv Val Pro Gin Val Ser Leu Gly Ser Met Trp Asp His Phe Tyr Arg 
1 340 345 350 

Ala Ara Arg Leu Glu Glu Leu Gly Ala Gly Leu Arg Leu Pro Ser Gly 
355 360 365 

Glu Leu Thr Ala Glu Gly Leu Arg Thr Arg Leu Glu Arg Val Leu Gly 
370 375 380 
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Glu Pro Ser Phe Gly Thr Ala Ala Gin Ala Leu Ser Asp Thr He Ala 
385 390 395 ^OU 

Ala Glu Pro Ser Pro Ser Glu Val Val Pro Val Leu Glu Glu Leu Thr 
405 410 4X 

Gly Arg His Arg Pro Gly Thr Arg 
420 

<210> 15 
<211> 139 
^212^ PRT 

<213> Streptomyces nogalater ATCC 27451 

<223> "translate of s/ioL, function: unknown" 

<400> 15 

Met Ser Thr Thr Ala Asn Lys Glu Arg Cys Leu Glu Met Val Ala Ala 
5 10 •*•-> 



1 



Trp Asn Arg Trp Asp Val Ser Gly Val Val Ala His Trp Ala Pro Asp 
20 25 JU 

Val Val His Tyr Asp Asp Glu Asp Lys Pro Val Ser Ala Glu Glu Val 
35 40 45 

Val Arq Arg Met Asn Ser Ala Val Glu Ala Phe Pro Asp Leu Arg Leu 
50 55 60 

Asp Val Arg Ser He Val Gly Glu Gly Asp Arg Val Met Leu Arg lie 
65 70 75 80 

Thr Cys Ser Ala Thr His Gin Gly Val Phe Met Gly He Ala Pro Thr 
85 90 y 

Gly Arg Lys Val Arg Trp Thr Tyr Leu Glu Glu Leu Arg Phe Ser Glu 
* 100 105 HO 

Ala Gly Lys Val Val Glu His Trp Asp Val Phe Asn Phe Ser Pro Leu 
115 120 125 

Phe Arg Asp Leu Gly Val Val Pro Asp Gly Leu 
130 I 35 

<210> 16 
<211> 155 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<223> "translate of snoO, function: homologous to mtinX of mithramycin 

cluster" 

<400> 16 

Met Ser Val Arg Thr Asp Gin Thr Ala Ala Pro Glu Asp Arg Ala Ala 
I 5 10 " 

Ala Thr Asp Pro Gly Phe Gly His Leu Tyr Ala Gin Val Gin Gin Phe 

— 25 



20 

35 



Tyr Ala Arg Gin Met Gin Leu Leu Asp Ser Gly Ala Ala Glu Glu Trp 
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Ala 


Ala 
50 


Thr 


Phe 


Thr 


Glu 


Asp 
55 


Gly 


Thr 


Phe 


Ala 


Arg 
60 


Pro 


Ser 


Ser 


Pro 


Glu 
65 


Pro 


Ala 


Arg 


Gly 


His 
70 


Ala 


Glu 


Leu 


Ala 


Ala 
75 


Gly 


Ala 


Arg 


Ala 


Ala 
80 


Ala 


Glu 


Arg 


Leu 


Ala 
85 


Ala 


Glu 


Gly 


Leu 


Ser 
90 


His 


Arg 


His 


Val 


He 
95 


Gly 


Met 


Thr 


Ala 


Val 
100 


Arg 


Arg 


Glu 


Pro 


Asp 
105 


Gly 


Ser 


Val 


Phe 


Val 
110 


Arg 


Ser 


Tyr 


Ala 


Gin 
115 


Val 


Phe 


Ala 


Thr 


Arg 
120 


Arg 


Gly 


Glu 


Ala 


Pro 
125 


Arg 


Leu 


His 


Leu 


He 
130 


Cys 


Val 


Cys 


Glu 


Asp 
135 


Val 


Leu 


Val 


Arg 


Glu 
140 


Gly 


Pro 


Gly 


Leu 


Lys 


Val 


Arg 


Glu 


Arg 


Val 
150 


Val 


Thr 


His 


Asp 


Ala 
155 












<21U> 

<211> 
<212> 
<213> 


17 

281 

PRT 

Streptomyces nogalater '< 


ATCC 


27451 














<220> 
<223> 


"translate 


of snoaF 


, function: 


C-7 


ketoreduct ase " 






<400> 


17 






























Val 
1 


Arg 


Ala 


Met 


Thr 
5 


Asp 


Ser 


Thr 


Gly 


n \J 

10 


Arg 


Pro 


Val 


Pro 


Ala 
15 


Met 


Ser 


Pro 


Ala 


Pro 
20 


Ser 


Pro 


Thr 


Pro 


Ser 
25 


Pro 


Gly 


Pro 


Ala 


Pro 
30 


Gly 


Ser 


Glu 


Pro 


Ala 
35 


Pro 


Leu 


Ala 


Val 


He 
40 


Val 


Thr 


Gly 


Gly 


Gly 
45 


Ser 


Gly 


He 


Gly 


Arg 
50 


Ala 


Thr 


Ala 


Arg 


Ala 
55 


Phe 


Ala 


Ala 


Gin 


Gly 
60 


Ala 


Lys 


Val 


Leu 


Val 
65 


Val 


Gly 


Arg 


Thr 


Glu 
70 


Asp 


Ala 


Leu 


Ala 


Gin 
75 


Thr 


Ala 


Glu 


Gly 


Cys 
80 


Ala 


Asp 


Met 


Arg 


Val 
85 


Leu 


Val 


Ala 


Asp 


Val 
90 


Ala 


Ser 


Pro 


Asp 


Gly 
95 


Pro 


Gin 


Ala 


Val 


Val 
100 


Asn 


Ala 


Ala 


Leu 


Arg 
105 


Glu 


Phe 


Gly 


Arg 


He 
110 


Asp 


val 


Leu 


Val 


Asn 
115 


Asn 


Ala 


Ala 


Val 


Ala 
120 


Gly 


Met 


Glu 


Thr 


Leu 
125 


Gin 


Thr 


Val 


Asp 


Arg 
130 


Asp 


Ala 


Val 


Ala 


Arg 
135 


Gin 


Phe 


Gly 


Thr 


Asn 
140 


Leu 


Thr 


Ala 


Pro 


Leu 
145 


Phe 


Leu 


Val 


Gin 


Ser 
150 


Ala 


Leu 


Gly 


Ala 


Leu 
155 


Glu 


Lys 


Ser 


Arg 


Gly 
160 


lie 


Val 


Val 


Asn 


Val 
165 


Gly 


Thr 


Ala 


Ala 


Thr 
170 


Leu 


Gly 


Leu 


Arg 


Ala 
175 


Ala 


Pro 


Thr 


Gly 


Ala 
180 


Leu 


Tyr 


Gly 


Ala 


Ser 
185 


Lys 


Val 


Ala 


Leu 


Asp 
190 


Tyr 


Leu 
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Thr 


Arg 


Thr 
195 


Trp 


Ala 


Val 


Glu 


Leu 
200 


Ala 


Pro 


Arg 


Gly 


He 
205 


Arg 


Val 


Val 


Gly 


Val 
210 


Ala 


Pro 


Gly 


Val 


He 
215 


Asp 


Thr 


Gly 


He 


Gly 
220 


Val 


Arg 


Met 


Gly 


Met 
225 


Thr 


Pro 


Glu 


Gly 


Tyr 
230 


Arg 


Glu 


Phe 


Leu 


Thr 
235 


Gly 


Met 


Gly 


Gly 


Arg 
240 


Val 


Pro 


Val 


Gly 


Arg 
245 


Val 


Gly 


Arg 


Pro 


Glu 
250 


Asp 


Val 


Ala 


Trp 


Trp 
255 


He 


Val 


Gin 


Leu 


Ala 
260 


Arg 


Pro 


Glu 


Ala 


Gly 
265 


Tyr 


Ala 


Thr 


Gly 


Met 
270 


Val 


Val 


Pro 


Val 


Asp 
275 


Gly 


Gly 


Leu 


Ser 


Leu 
280 


Val 
















<210> 
<211> 
<212> 
<213> 


18 

190 

PRT 

StreptomycBs nogalater . 


ATCC 


27451 














<220> 
<223> 


"translate 


of snoN, 


function: unknown" 












<400> 


18 






























Val 
1 


Gin 


Glu 


Thr 


Glu 
5 


Pro 


Gly 


Val 


Pro 


Ala 
10 


Asp 


Leu 


Pro 


Ala 


Glu 
15 


Ser 


Asp 


Pro 


Ala 


Ala 
20 


Leu 


Glu 


Arg 


Leu 


Ala 
25 


Ala 


Arg 


Tyr 


Arg 


Arg 
30 


Asp 


Gly 


Tyr 


Val 


His 
35 


Val 


Pro 


Gly 


Val 


Leu 
40 


Asp 


Ala 


Gly 


Glu 


Val 
45 


Ala 


Glu 


Tyr 


Leu 


Ala 
50 


Glu 


Ala 


Arg 


Arg 


Leu 
55 


Leu 


Ala 


His 


Glu 


Glu 
60 


Ser 


Val 


Arg 


Trp 


Gly 
65 


Ser 


Gly 


Ala 


Gly 


Thr 
70 


Val 


Met 


Asp 


Tyr 


Val 
75 


Ala 


Asp 


Ala 


Gin 


Leu 
80 


Gly 


Ser 


Asp 


Thr 


Met 
85 


Arg 


Arg 


Leu 


Ala 


Thr 
90 


His 


Pro 


Arg 


He 


Ala 
95 


Ala 


Leu 


Ala 


Glu 


Tyr 
100 


Leu 


Ala 


Gly 


Ser 


Pro 
105 


Leu 


Arg 


Leu 


Phe 


Lys 
110 


Leu 


Glu 


Val 


Leu 


Leu 
115 


Lys 


Glu 


Asn 


Lys 


Glu 
120 


Lys 


Asp 


Ala 


Ser 


Val 
125 


Pro 


Thr 


Ala 


Pro 


His 
130 


His 


Asp 


Ala 


Phe 


Ala 
135 


Phe 


Pro 


Phe 


Ser 


Thr 
140 


Ala 


Gly 


Thr 


Ala 


Leu 
145 


Thr 


Ala 


Trp 


Val 


Ala 
150 


Leu 


Val 


Asp 


Val 


Pro 
155 


Val 


Glu 


Arg 


Gly 


Cys 
160 


Met 


Thr 


Phe 


Val 


Pro 
165 


Gly 


Ser 


His 


Leu 


Leu 
170 


Pro 


Asp 


Pro 


Asp 


Thr 
175 


Gly 


Asp 


Glu 


Pro 


Trp 
180 


Ala 


Gly 


Ala 


Phe 


Thr 
185 


Arg 


Pro Gly 


Glu 


He 
190 







